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It  is  the  purpose  of  this  thesis  to  study  graph-theoretic  problems 
that  arise  in  certain  practical  situations.  These  problems  deal  with  classes 
of  ordered  trees,  with  classes  of  undirected  graphs  appearing  in  scheduling 
problems,  and  with  algorithms  that  label  free  trees. 

First  we  study  problems  concerning  ordered  trees.  We  show  how  to 
generate  trees  in  certain  classes  in  order,  how  to  determine  the  position  of 
a  given  tree  and  how  to  find  a  tree  given  its  position.  We  also  show  several 
enumeration  results  for  the  class  of  ordered  trees  with  n  edges.    Next  we 
investigate  a  class  of  undirected  graphs  that  arise  in  certain  scheduling 
problems.  This  class  is  fully  characterized,  and  several  of  its  extensions 
are  studied.    Last  we  study  algorithms  that  label  edges  of  a  given  tree 
with  given  labels,  optimizing  certain  objective  functions.  For  some  of  these 
functions  we  show  polynomial -time  algorithms,  and  for  others  the  problem  is 
shown  to  be  NP-complete. 
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CHAPTER  1 
INTRODUCTION 

1.1  AN  OVERVIEW 

It  is  the  purpose  of  this  thesis  to  study  graph-theoretic  problems 
which  arise  in  certain  practical  situations.  These  problems  deal  with  classes 
of  ordered  trees,  with  classes  of  undirected  graphs  appearing  in  scheduling 
problems,  and  with  algorithms  that  label  free  trees. 

First  we  study  problems  concerning  ordered  trees.  For  example, 
suppose  we  have  an  algorithm  that  operates  on  a  certain  class  S  of  ordered 
trees,  and  we  want  to  measure  its  performance  empirically.  For  this  we  might 
need  to  run  the  algorithm  on  all  the  trees  in  S,  or  (in  case  S  contains  too 
many  trees)  to  run  it  on  a  random  sampling  of  trees  from  S.  In  both  cases 
we  first  have  to  define  a  linear  ordering  in  S,  and  then  either  generate  all 
the  trees,  one  by  one,  according  to  this  ordering,  or  to  generate  the  r-th 
tree  in  S  for  some  random  number  r.  As  another  example,  suppose  we  deal  with 
a  function  f,  defined  on  trees  in  S,  and  suppose  f  depends  on  some  property  P 
of  the  tree  (say,  the  number  of  leaves).  To  compute  the  average  of  f  over  S  we 
need  to  study  the  average  of  P  over  S  (the  expected  number  of  leaves  in  the 
above  example).  These  problems  are  studied  in  chapters  2,  3  and  4.  The 
first  problem  is  studied  when  S  is  the  set  of  regular  binary  trees  with  n 
internal  nodes,  the  set  of  k-ary  trees  with  n  internal  nodes,  and  the  set  of 
ordered  trees  with  n^  internal  nodes  of  degree  k . ,  i=l,2,...t  for  some  t;   all 
of  this  is  treated  in  chapters  3  and  4.  The  second  problem  is  studied  in 
chapter  2,  where  S  is  the  set  of  ordered  trees  with  n  edges.  The  discussion  of 


this  is  based  on  [Zaks,  1977a],  [Zaks,  1977b],  [Zaks  and  Richards,  1977] 
and  [Dershowitz  and  Zaks,  1979]. 

Next  we  study  classes  of  undirected  graphs  which  arise  in  certain 

scheduling  problems.  Suppose  we  have  n  tasks  t-. ,  t« t   to  be 

executed  on  two  identical  processors  that  share  the  same  memory,  and  that 

task  t.  requires  a  fraction  c.  of  the  memory  for  its  execution.  Two  distinct 

tasks  t.  and  t.  can  be  executed  simultaneously  if  their  total  memory 

requirement  does  not  exceed  the  available  amount  of  memory,  namely  if 

Cj  +  Cj  <  T.  This  situation  can  be  described  by  a  graph  in  which  the  vertex 

n.  is  associated  with  task  t. ,  and  there  is  an  edge  between  n.  and  n.  if 

c.  +  c.  <.  1,  where  c  is  the  label  of  n . .  The  class  of  graphs  that  can  be 

obtained  in  this  way  is  investigated.  Extending  the  scheduling  problem  to 

the  case  when  the  two  processors  share  more  than  one  resource  gives  rise 

to  new  classes  of  graphs.  All  of  this  is  the  subject  of  chapter  5,  based  on 

[Ecker  and  Zaks,  19771. 

Last  we  study  algorithms  that  label  edges  of  free  trees.  Suppose  we 

have  n+1  users  and  a  tree-like  communication  network  (with  n  connections) 

connecting  them,  and  we  are  given  n  communication  lines  of  different  'weights' 

W-, ,  w2  ...  ,  w  .  Assign  the  weights  to  the  edges,  and  define  the  communication 
cost  c(i,j)  between  users  i  and  j  as  the  sum  of  the  weights  on  the  (unique) 
path  connecting  them.  We  want  to  determine  an  assignment  that  minimizes  (or 
maximizes)  the  largest  of  the  c(i,j)'s,  or  one  that  minimizes  (or  maximizes) 
the  average  of  the  c(i,j)'s.  These  problems,  and  other  related  ones,  are 
discussed  in  chapter  6,  based  on  [Zaks  and  Perl,  1978]. 


1.2  SUMMARY  OF  RESULTS 

1.  Ordered  trees  (chapters  2-4) 

We  show  one-to-one  correspondences  among  the  regular  binary  trees 
with  n  internal  nodes,  the  ordered  trees  with  n  edges,  the  0,1  sequences  of 
n  l's  and  n  O's  in  which  the  number  of  l's  in  each  prefix  is  not  smaller 
than  the  number  of  O's,  the  lattice  paths  from  (0,0)  to  (n,n)  which  do  not 
go  below  the  diagonal  y=x,  and  other  classes  of  combinatorial  objects. 
Linear   ordering  of  trees  is  discussed,  and  relations  to  lexicographic 
ordering  of  the  corresponding  sequences  are  studied. 

Using  these  correspondences  we  find  several  enumeration  results 

concerning  the  class  of  ordered  trees  with  n  edges.  For  example,  we  show 

r  /2n-r-l\ 
that  -•!   ,   J  of  these  trees  have  root  degree  r,  that  the  expected  root 

degree  for  this  class  is  ;rp5->  and  that  the  number  of  nodes  of  deqree  d  in 

this  class  is  f  "","  J.  All  of  this  is  the  subject  of  chapter  2. 

In  chapter  3  we  develop  an  algorithm  that  generates  lexicographically 

sequences  corresponding  to  regular  binary  trees  with  n  internal  nodes,  we  show 

how  to  determine  the  position  of  a  sequence  and  how  to  determine  the  sequence 

given  its  position  in  this  ordering.  The  generating  algorithm  is  then 

generalized  to  k-ary  trees,  and  is  compared  to  existing  related  algorithms. 

The  complexity  of  the  generating  algorithm  is  measured  by  the  number  of 

comparisons  it  makes.  We  show  that  the  algorithm,  that  generates  sequences 

corresponding  to  k-ary  trees  with  n  internal  nodes,  uses,  on  the  average, 

less  than  two  comparisons  per  sequence  (independent  of  n  and  k). 

In  chapter  4  these  results  are  extended  as  follows:  First  the 
generating,  ranking  and  unranking  algorithms  are  studied  for  k-ary  trees, 
using  a  different  ordering  of  these  sequences,  and  then  all  the  discussion  of 


chapter  3  (generating,  ranking  and  unranking  algorithms  for  binary  trees)  is 
extended  to  the  classes  of  ordered  trees  with  n.  internal  nodes  of  degree  k., 
i=l  ,2,. . . ,t  for  some  t. 

2.  A  graph  labeling  problem  (chapter  5) 

k 
The  classes  GR  ,  k>0,  of  undirected  graphs  are  defined  as  follows: 

k 
GR  contains  all  the  graphs  G=(V,E)  for  which  there  exists  a  labeling  of  the 

i/ 

vertices  in  V  with  vectors  in  [0,1]   such  that  there  is  an  edge  connecting 

two  vertices  if  and  only  if  the  sum  of  their  labels  does  not  exceed  (1, !,...,!). 
Such  a  labeling  is  called  a  k-labeling. 

The  class  GR  is  fully  characterized,  and  is  shown  to  coincide  with 

n  1 

the  class  of  threshold  graphs.  It  is  shown  that  there  are  2  "   non- 
isomorphic  graphs  in  GR  with  n  vertices,  and  the  proof  provides  a  simple  and 
elegant  way  to  generate  all  those  graphs  and,  given  a  graph  G,  to  decide 

whether  it  is  in  GR  . 

k 
The  classes  GR  ,  k>l,  are  then  studied  and  several  results  and 

examples  are  explored.  It  is  shown  that  GR1  is  properly  included  in  GR 

for  each  i.  It  is  known  that,  given  a  graph  G  and  an  integer  k,  the  problem 

k 
of  determining  whether  G  is  in  GR   is  NP-complete.  Therefore  it  seems  that 

k 

a  simple  characterization  of  the  classes  GR  ,  k>l ,  is  not  likely  to  exist  . 

For  example,  although  the  class  GR  can  be  characterized  by  a  finite  number  of 

2 

forbidden  induced  subgraphs,  this  is  not  the  case  for  GR  . 

3.  Optimal  label ings  for  trees  (chapter  6) 

Given  a  free  tree  T  with  n  edges  and  a  set  W  of  n  weights,  we  study 
algorithms  for  labeling  the  edges  of  T  with  weights  from  W  that  optimize  certain 
objective  functions.  Some  of  these  problems  are  shown  to  be  NP-complete,  and 
for  others  we  present  polynomial -time  algorithms. 


For  example,  we  prove  that  determining  whether  a  given  tree  can 
be  labeled  with  a  given  set  of  weights  such  that  its  diameter  will  not  be 
larger  than  a  given  number  is  an  NP-complete  problem;  the  same  problem 
remains  NP-complete  when  'diameter' is  replaced  by 'radius'.  Other  related 
NP-complete  results  are  also  derived. 

The  problem  of  minimizing  or  maximizing  the  average  distance 
between  the  vertices  of  T,  given  T  and  W  as  above,  is  solvable  in  polynomial 
time,  as  well  as  other  related  problems. 


CHAPTER  2 
OWEREV  TREES 

2.1   INTRODUCTION 

Ordered  trees  play  a  key  role  in  theoretical  computer  science. 
For  example,  derivation  trees  are  used  quite  often.  In  the  next,  three 
chapters  we  discuss  several  problems  concerning  these  trees. 

In  certain  practical  situations  we  are  concerned  with  trees  from 
a  certain  class  S,  Suppose  we  have  an  algorithm  that  operates  on  trees  in 
S,  and  we  want  to  measure  its  performance.  For  this  we  might  like  to  run 
the  algorithm  on  all  trees  in  S,  or  (if  S  is  too  large)  to  run  it  on  some 
randomly  chosen  trees.  In  both  cases  we  first  might  want  to  define  a  linear 
ordering  on  S,  and  then  either  generate  all  trees  or  generate  the  r-th  tree, 
for  a  random  number  r,  according  to  this  ordering.  As  another  related 
situation,   suppose  we  are  looking  for  trees  with  given  properties  in  S. 
If  we  don't  have  a  better  way,  we  might  proceed  as  follows:  define  a  linear 
ordering  on  S,  and  then  generate  the  trees  ,  one  by  one,  according  to  this 
ordering,  until  we  either  find  all  the  trees  we  are  looking  for  or  reach 
the  last  tree.  As  yet  another  situation  ,  we  might  like  to  study  certain 
properties  of  a  tree  chosen  randomly  from  S;  for  example,  we  might  want  to 
know  what  is  the  expected  number  of  leaves  in  such  a  tree,  if  we  are 
dealing  with  a  cost  function  that  depends  on  this  quantity. 

In  this  chapter  we  present  ordered  trees  and  discuss  several  one- 
to-one  correspondences  among  certain  classes  of  these  trees,  integer 
sequences,  lattice  paths  and  other  combinatorial  objects.  Most  of  this 


chapter  reviews  results  of  a  combinatorial  flavor.  We  bring  it  here  in  order 
to  keep  the  thesis  self-contained.  In  the  following  two  chapters  we  will 
discuss  the  problem  of  generating  trees. 

We  discuss  one-to-one  correspondences  among  the  regular  binary  trees 
with  n  internal  nodes,  the  ordered  trees  with  n  edges,  the  0,1  sequences  with 
n  l's  and  n  0's  in  which  in  any  prefix  the  number  of  l's  is  not  smaller  than 
the  number  of  0's,  the  set  of  lattice  paths  from  (0,0)  to  (n,n)  which  do  not 
go  below  the  diagonal  y=x,  and  other  combinatorial  objects.  Linear  orderings 
of  trees  are  discussed,  and  the  connection  between  them  and  the  lexicographic 
ordering  of  the  corresponding  integer  sequences  are  investigated.  We  then 
study  the  Cycle  Lemma,  and  use  it  to  enumerate  certain  lattice  paths.  At  the 
end  of  the  chapter  we  give  enumeration  results  concerning  the  class  of  trees 

with  n  edges.  For  example,  we  show  the  following:  the  number  of  such  trees 

r  /2n-r-l\ 
which  have  a  root  of  degree  r  is  -I   ,   J,  the  expected  root  degree  in 

this  class  is  — ^5-  ,  the  total  number  of  nodes  of  degree  d  in  these  trees  is 

(  ~,~  J.  We  also  present  two  new  proofs  for  the  number  —  f !? Vf .  "-.  J  of  trees 

with  n  edges  and  k  leaves. 

Basic  notions  are  given  in  section  2.2.  A  detailed  discussion  of 
the  one-to-one  correspondences  is  the  subject  of  section  2.3.  Section  2.4 
deals  with  the  ordering  of  trees,  and  section  2.5  discusses  the  Cycle  Lemma 
and  the  enumeration  of  lattice  paths.  The  enumeration  results  about  ordered 
trees  are  presented  in   section  2.6.  As  mentioned  before,  most  of  this 
chapter  is  a  review;  several  references  on  these  subjects  are  cited  in 
section  2.7. 


2.2  PRELIMINARIES 

We  deal  with  the  combinatorial  structures  known  as  ordered  trees. 
We  state  here  definitions,  notations  and  properties  that  we  will  use  in  the 
sequel.  Our  terminologies  mainly  follow  [Knuth,  1968]  and  [Liu,  1977]. 

An  ordered  tree  has  a  distinguished  node  r,  called  the  root  of  the 
tree,  and  m  subtrees  t, ,  t«,  ...  ,  t  ,  m>0,  of  the  root  r,  where  each  t. 
is  also  an  ordered  tree.  It  should  be  emphasized  that  the  relative  order  of 
the  subtrees  is  important.  The  roots  of  the  subtrees  are  sons  of  the  root  r, 
and  r  is  their  father.  The  degree  of  a  node  in  a  tree  is  the  number  of  its 
sons,  and  it  resides  on  level  £,  where  i   is  its  distance  (number  of  edges 
seperating  it)  from  the  root.  The  height  of  a  tree  is  the  maximal  level  of 
any  of  its  nodes.  A  node  of  degree  0  is  called  a  leaf,  otherwise  it  is  an 
internal  node.  A  k-ary  tree  is  an  ordered  tree  in  which  each^node  has  at 
most  k  sons;  in  a  binary  tree  (k=2)  each  node  has  0,  1  or  2  sons.  A  k-ary 
tree  is  called  regular  if  each  internal  node  has  exactly  k  sons;  in  a  . 
regular  binary  tree  each  internal  node  has  exactly  two  sons,  a  left  son 
and  a  right  son.  Figure  2.1  presents  an  ordered  tree  with  4  internal  nodes 
and  5  leaves;  it  is  a  non-regular  ternary  (k=3)  tree  of  height  3.  The  root 
a  has  degree  3,  and  the  rest  of  the  internal  nodes  b,e  and  f  have  degrees 
2,  2  and  1,  respectively.  The  leaves  are  c,d,g,h  and  i,  and  the  levels  of 
the  nodes  are  shown  in  that  figure. 

T(k,n)  denotes  the  set  of  regular  k-ary  trees  with  n  internal 
nodes,  and  t(k,n)  is  the  number  of  these  trees.  Bn=T(2,n)  and  bn=t(2,n) 
stand  for  the  corresponding  entities  in  the  binary  case.  Tn  denotes  the 
set  of  ordered  trees  with  n  edges.  The  regular  binary  trees  with  n<3 
internal  nodes,  and  the  ordered  trees  with  n<3  edges  are  shown  in  figures 
2.2  and  2.3,  respectively;  these  trees  for  n=4  are  shown  in  section  3.8. 


level  0 


level  1 


level  2 


level  3 


An  ordered  tree  with  eight  edges 
Figure  2.1 


42,... 


The  sizes  of  the  sets  of  trees  T  ,  or  B  ,  form  a  series  1,1,2,5,14, 

These  are  the  well-known  Catalan  numbers  C  : 

n 


C     =     B       =     T 
n       I    nl        I  nl 


1     /2n 

n+1  \  n 


There  are  many  sets  of  combinatorial  objects  whose  cardinality  is  C   ,  and  some 
of  them  are  mentioned  in  section  2.3.     It  is  also  known  that  the   number  of 
regular  k-ary  trees  with  n  internal   nodes,  t(k,n),  is  given  by 


'<"•">  ■  tfWC")  • 


of  which  the  Catalan  numbers  are  a  special  case  (see  [Knuth,  1968,  p.  584]) 
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Regular  binary  trees  with  n<3  internal  nodes 
Figure  2.2 
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Ordered  trees  with  n<3  edges 
Figure  2.3 


12 


The  following  sequence  a(T),  associated  with  a  tree  T,  will  be 
often  used:  label  each  node  with  its  degree,  and  read  these  labels  in 
preorder  (root-left-right)i  om1t|*a§ptfeeilaSfioQfc  iAn  example  is  shown  in 
figure  2.4, 


a(T)   =     40203000  0  1 


An  ordered  tree  T  and  the  corresponding 
sequence  a(T) 
Figure  2.4 
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2.3  CORRESPONDENCES 


We  present  now  several  one-to-one  correspondences  between  certain 
sets  of  ordered  trees,  lattice  paths,  integer  sequences  and  parenthetic 
expressions..  A  list  of  these  sets  follows: 


B 


B' 
n 


the  set  of  regular  binary  trees  with  n  internal  nodes, 
the  set  of  binary  trees  with  n  nodes, 
the  set  of  ordered  trees  with  n  edges. 

the  set  of  0,1  sequences  with  n  l's  and  n  0's,  such  that  in  each 
prefix  the  number  of  l's  is  not  smaller  than  the  number  of  0's; 
this  property  of  a  0,1  sequence  is  called  the  dominating  property, 
the  set  of  integer  sequences  z={z.}?  ,  such  that  0  <  z,  <  z~"% 
<   z. 


and  z.  <_  21 — 1  for  each  i 


Q 


n  and 


the  set  of  integer  sequences  a={a.}1  such  that  £  a. 
i  j=l  J 

z  a.  >  i  for  each  i.  Note  that  A  is  a  subset  of  the  set  of  al 

partitions  (p1 ,  p2,  ...  ,  p  )  of  an  integer  n  (where  each 

p.  ^  0  and  z  p.  =  n). 

i 
the  set  of  lattice  paths  from  the  point  (0,0)  to  (n,n),  which  do 

not  pass  below  the  diagonal  y=x  (in  a  lattice  path  all  steps  are 

either  up  or  to  the  right);  a  lattice  path  is  called  legal  if 

it  does  not  go  below  the  diagonal  y=x. 

the  set  of  parenthetic  expressions  consisting  of  n  open  and  n  close 

parentheses,  in  which  each  open  parenthesis  has  a  matching  close 

parenthesis  after  it;  such  an  expression  is  called  legal . 


We  now  illustrate  these  correspondences  in  a  series  of  figures  and 
short  explanations;  more  references  can  be  found  in  section  2.7. 
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We  start  with  a  tree  T  in  Bg  (figure  2.5) 


T  : 


A  regular  binary  tree  in  Bp 
Figure  2.5 


If  we  remove  all  the  leaves  and  the  edges  incident  with  them,  we 
get  a  tree  T'  in  B'  (figure  2.6).  This  correspondence  between  B  and  B' 
is  one-to-one  (see  [Knuth,  1968,  p. 559]). 

If  we  label  each  internal  node  of  T  with  1  and  each  leaf  with  0, 
and  read  these  labels  in  preorder,  except  forthe  last  leaf,  we  get  a 
sequence  of  8  l's  and  8  0's,   x-(T)  ,  in  Xg.   (See  figure  2.7.)   The 
correspondence  between  B   and  Xn  is  discussed  in  various  references  (see, 
for  example,  [DeBruijn  and  Morsel t,  1967]  or  [Knuth,  1973,  p. 63]). 
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T' 


A  binary  tree  in  Bp 
Figure  2.6 


x(T)  =  1101001100111000 

A  0,1  sequence  in  X0 
Figure  2.7 

If  we  replace  1  and  0  in  x(T)  by  open  and  close  parentheses, 
we  obtain  a  legal  parenthetic  expression  p(T)  in  P«  (figure  2.8).  The  one- 
to-one  correspondence  between  X   and  P   is  immediate. 

n  n 

P(T)   =     [[][]][[]][[[]]] 


A  parenthetic  expression  in  P„ 
Figure  2.R 
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From  the  sequence  x(T)  we  build  the  sequence  z(T),  where  z.  is 
the  position  of  the  i-th  1  in  x(T);  this  sequence  is  in  Zg  (see  figure  2.9). 
The  one-to-one  correspondence  between  X   and  Z   is  discussed  in  [Zaks,  1977a] 
and  follows  from  the  discussion  in  [Knuth,  1973,  p.  63].  When  no  confusion  occurs, 
we  use  x  and  z  for  x(T)  and  z(T),  respectively. 

z(T)  =  1  2  4  7  8  11  12  13 


An  integer  sequence  in  Z» 
Figure  2.9 

This  0,1  sequence  x(T)  can  also  be  obtained  by  traversing  the  ordered 
tree  t(T)  from  Tg,  shown  in  figure  2.10,  when  1  means  'going  down'  and  0  means 
'going  up'.  It  is  clear  that  such  a  sequence  has  the  dominating  property, 
since  in  each  step  the  total  number  of  'going  down's  must  be  not  smaller  than 
the  total  number  of  'going  up's;  it  is  also  clear  that  the  number  of  steps 
down  is  equal  to  the  number  of  steps  up.  That  this  correspondence  is  one-to- 
one  follows  from  the  above  observation  (see,  for  example,  [DeBruijn  and 
Morselt,  1967]). 


t(T)  : 


An  ordered  tree  in 
Figure  2.10 


6 
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Another  way  of  interpreting  this  0,1  sequence  x(T)  is  via  lattice 
paths:  x(T)  corresponds  to  a  lattice  path  l(J)     from  (0,0)  to  (8,8),  when 
1  means  'one  step  up'  and  0  means  'one  step  to  the  right'  (see  figure  2.11) 
This  path  has  the  property  that  it  does  not  pass  below  the  diagonal  y=x;  in 
other  words,  a  (J)   is  in  Lq.  The  one-to-one  correspondence  between  X   and 
L   follows  immediately. 


*(T) 


1 

/'■" 

(8,8) 

► 

(0,0) 


A  lattice  path  in  Lg 
Figure  2.11 
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The  sequence  a(T)  was  previously  defined  for  any  tree  T.  Applying 
it  to  the  tree  t(T),  we  obtain  an  integer  sequence  a(t(T))  in  Afi.  That  this 
correspondence  is  one-to-one  is  the  subject  of  theorem  4.4  in  chapter  4;  see 
also  [Chorneyko  and  Mohanty,  1975].  See  figure  2.12. 

a(t(T))  =  3  2  0  0  10  11 

A  sequence  in  Ap 
Figure  2.12 

This  sequence  a(t(T))=32001011  corresponds  also  to  another  lattice 
path  £'(T):  the  one  in  which  we  go  up  3  steps  on  the  first  column,  then  2 
steps  on  the  second  column,  then  0  steps  on  the  third  column,  etc. 


i'(T)  : 


(0-0) 


Ik 

/  yx 

V 

18,8) 

7 

i 

^ 

Another  lattice  path  in  Lp 
Figure  2.13 
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Since  the  sum  of  the  first  i  elements  in  a(t(T))  is  not  more  than  i,  it 
follows  that  £'(T)  does  not  go  below  the  diagonal  y=x,  hence  it  is  in  Lfi. 
This  correspondence  is  clearly  one-to-one.  (See  figure  2.13  .) 

Notes:  1.  The  correspondence  between  T   and  B'  is,  in  fact,  the  one 

r  n      n 

described  in  [Knuth,  1968,  p.  333]. 

2.  The  correspondence  between  13   and  L  ,  as  discussed  above,  is 
r  n      n 

later  extended  to  a  correspondence  between  T(k,n)  (regular  k-ary 
trees *~m th -  n  internal  nodes)  and  L(k,n)  ('legal'  lattice-paths 
from  (0,0)  to  ((k-l)n.n)  ). 

The  following  properties  follow  directly  from  these  correspondences: 

Lemma  2.1:  Let  T  e  B  .  Then 

n 

(1)  The  number  of  leaves  in  the  tree  t(T) 
=  the  number  of  10  patterns  in  x(T) 

=  the  number  of  corners  (i.e.,  a  path  segment  of  the  form    )  in  z(J) 
=  the  number  of  left  leaves  in  T. 

(2)  The  number  of  internal  nodes  in  the  tree  t(T) 

=  one  more  than  the  number  of  11  (or  equivalently  00)  patterns  in  x(T) 
=  one  more  than  the  number  of  I     (or  equivalently  ■ — •— •)  path  segments 

in  A(T) 
=  the  number  of  right  leaves  in  T. 

(3)  The  number  of  nodes  of  degree  d  in  the  tree  t(T) 
=  the  number  of  occurrences  of  d  in  a(t(T)) 

=  the  number  of  vertical  path  segments  of  length  exactly  d  in  a'(T). 

(4)  The  number  of  nodes  of  degree  d  in  all  the  trees  in  Tn 

=  the  number  of  occurrences  of  runs  of  exactly  d  l's  (or  equivalently 
d  0's)  in  the  sequences  in  X  . 
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The  following  encoding  of  a  tree  will  be  used  in  discussing 
ordering  of  trees  in  section  2.4:  label  each  node  with  the  size  (number  of 
nodes)  of  the  tree  rooted  at  this  node,  and  then  read  these  labels  in  preorder, 
For  T  (figure  2.5)  we  get  the  sequence  s(T),  shown  in  figure  2.14 


s(T)  =  17  5  1  3  1  1  11  3  1  1  7  5  3  1  1  1  1 

A  sequence  of  subtree  sizes 
Figure  2 ._}< 4 

The  following  two  additional  encodings  are  also  used  in  the 
literature:  [Ruskey  and  Hu,  1977]  represent  a  tree  in  B  by  the  sequence 
of  the  levels  of  its  leaves  (see  figure  2,13).  In  [Knuth,  1968,  2.2.1], 
[Trojanowski ,  1977a,b]  and  [Knott,  1977]  the  numbers  l,2,...,n  are  used  to 
label  the  internal  nodes  of  T  in  some  order  (say,  preorder),  and  then  these 
labels  are  read  in  a  different  order  (say,  inorder:  left-root-right).  We  get 
in  this  way  a  permutation  of  the  numbers  l,2,...,n  (known  as  a  stack  permutation; 
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see  also  [Rotem  and  Varol ,  1977])  .  See  example  in  figure  2.16, 


the  sequence: 
233335543 


Ruskey  and  Hu's  level  numbers 
Figure  2.15 


the  permutation: 
2  3  15  4  8  7  6 


A  stack  permutation  (preorder  labeling,  inorder  reading) 

Figure  2.16 
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2.4  LINEAR  ORDERING  OF  TREES 

Our  goal  in  the  next  two  chapters  is  to  generate  trees  in  order. 
In  order  to  define  a  linear  ordering  for  a  certain  class  of  trees,  we  can 
first  establish  an  encoding  of  these  trees  into  certain  sequences,  and  then 
use  the  lexicographic  ordering  of  sequences  to  define  the  ordering  for  the 
trees.  The  lexicographic  ordering  for  integer  sequences  is  defined  as  follows: 
let  u={u.}q  and  v={v. }q  ;  we  say  that  u  <  v  if  uQ  <  vQ  or  if  there  exists 

j,  1  <_  j  <^n,  such  that  u.  =  v.  for  i=0,l j-1  and  u.  <  v..  Regarding 

u  and  v  as  English  words,  u  <  v  means  that  u  precedes  v  in  an  English 
dictionary.  If  instead  of  comparing  u  and  v  from  left  to  right  (starting  with 
uQ  and  vfi)  we  compare  them  from  right  to  left  (starting  with  u  and  v  ),  we 
get  a  different  lexicographic  ordering.  For  obvious  reasons  we  call  this  the 
Hebrew  lexicographic  ordering  (if  one  prefers,  he  might  replace  'Hebrew'  with 
(Arabic^  ...).  For  example,  1256  <  1346  according  to  the  English  ordering, 
but  1256  >  1346  according  to  the  Hebrew  ordering. 


We 


now  define  recursively  a-order  and  a '-order  for  trees;  rT 


denotes  the  degree  of  the  root  of  T,  and  the  T. 's  are  the  subtrees  of  the  root 
Definition  (a-order):  Given  ordered  trees  T  and  T' ,  we  say  that  T  <  T'  if 


(1)  rj  <  rT'   '  or 


(2)  rT  =  r-p,  and  for  some  i,  1  <_  i  <  rT, 


we  have 


(a)  T,  =  Ti  for  j  =  l ,2,. . . ,i-l ,  and 

(b)  Tl  <a  Tj  . 

Definition  (a '-order):  Given  ordered  trees  T  and  T',  we  say  that  T  <1J'  if 

(1)  IT|  <  |  T ■ |   (  |T|  is  the  number  of  nodes  in  T),  or 

(2)  |T |  =  IT'  i  ,  and  for  some-i,  1  ■<,  i  ±  rT, ,  we  have 

(a)  T.  =  Tj  for  j  =  l  ,2, . . . ,i-l ,  and 


(b)  T.  <  T! 
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Both  these  orderings  are  special  cases  of  the  ordering  in  [Knuth, 
1968,  p.  331].  The  relation  between  the  orderings  of  T  and  T'  and  their 
corresponding  sequences  a(T),  a(T'),  s(T)  and  s(T')  is  the  following: 
Ordering  Lemma:  For  ordered  trees  T  and  T' 

(1)  T  <J'   if  and  only  if  s(T)  <  s(T')  , 

(2)  T  <  T'  if  and  only  if  a(T)  <  a(T')  . 

Proof:  (1)  is  immediate  from  the  definition  of  a'-order  and  the  definition  of 
lexicographic  ordering  of  sequences.  The  proof  of  (2)  can  be  found  in  [Zaks 
and  Richards,  1977].  r\ 

See  example  in  figure  2.17. 


T  : 


T' 


s(T) 

=  97151311 

s(T')  = 

=  953111311 

a(T) 

-  22020200 

a(T')  = 

=  222000200 

T  >,T' 

a 

(note:  s(T)  > 

s(T')  ) 

T  <  T' 

a 

(note:  a(T)  < 

a(T')  ) 

Ordering  of  trees 

Figure  2.17 
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A  third  ordering  will  be  used  in  the  first  half  of  chapter  4: 
Definition   (e-order):     Given  ordered  trees  T  and  T',  we  say  that     T  <  T'     if 

(1)  r-j-  =  r-p    ,  and  for  some  i,  1  <  1  <  rv,  we  have 

(a)  T.  =  Tj     for     j=rT>  r.p-1 ,.. . ,rT-1+2,  and 

(b)  Trri+1   I  TrT-i+l      '     or 

(2)  rT  <  rr. 

It  follows  from  the  above  discussion  that  t(T)  <  t(T')  if  and  only  if 
x(T)  <  x(T'),  or  equivalently  z(T)  <  z(T'),  by  the  Hebrew  lexicographic 
ordering,  for  T,  T'  in  B 


2.5  LATTICE  PATHS  AND  THE  CYCLE  LEMMA 

The  following  lemma  turns  out  to  be  very   useful;  it  goes  back  to 
[Motzkin,  1948],  and  has  since  been  rediscovered  a  number  of  times  and 
generalized  (see  [Raney,  1960],  [  Takacs  ,  1967],  [Bergman,  1978]  and 
[Singmaster,  1978]). 

Cycle  Lemma:   Forrany  0,1  sequence  aia2,*,am+n  °^  n  ^'s  and  m  0s'  n  >  m' 
there  exist  exactly  n-m  cyclic  permutations 

ajaj+r--VnaT"aj-l 
that  have  the  strong  dominating  property.   (A  0,1  sequence  is  said  to  have  the 
strong  dominating  property  if  in  each  prefix  the  number  of  l's  is  greater  than 
the  number  of  0's.) 

Proof:  Arrange  the  given  sequence  a-jap.-.a    around  a  circle.  It  is  clear 
that  removing  a  10  pattern  does  not  affect  the  number  of  cyclic  permutations 
that  have  the  strong  dominating  property.  We  can  always  remove  m  such  pairs. 
The  remaining  n-m  l's  are  the  positions  at  which  a  cyclic  permutation  that 
have  this  property  can  start.  O 
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This  proof  is  similar  to  the  one  given  in  [Singmaster,  1978];  for 
more  proofs  see  [Zaks  and  Dershowitz,  1979]. 

For  example,  given  the  sequence  0010110111,  we  arrange  it  around  a 
circle  (see  figure  2.18  ).  After  removing  the  10  pairs  (shown  in  double  arrows) 
we  have  two  l's  left  (pointed  to  by  an  arrow),  and  the  two  cyclic  permutations, 
that  start  at  these  two  l's  and  have  the  strong  dominating  property,  are 
1101110010  and  1110010110. 


Cyclic  permutations 
Figure  2.18 

This  lemma  is  now  used  to  count  the  number  a(i,j)  of  lattice-paths 
from  (0,0)  to  (i,j),  j  >_  i ,  that  do  not  pass  below  the  diagonal  y=x.  (See,  for 
example,  [Yaglom  and  Yaglom,  1964,  problem  83]  . )  As  was  shown  before,  these 
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paths  correspond  to  sequences  of  j  l's  and  i  O's  that  have  the  dominating 
property.  If  we  attach  a  1  in  front  of  each  such  sequence  we  get  a  sequence 
that  has  the  strong  dominating  property.  There  are  I1  •?  ]  possible  linear 
arrangements  of  the  j+1  l's  and  i  O's.  Out  of  the  j+i+1  cyclic  permutations 
of  each  sequence  exactly  j+l-i  have  the  strong  dominating  property  (by  the 
Cycle  Lemma),  hence  the  number  of  legal  sequences,  or  equivalently  the  number 
of  legal  lattice-paths,  is  given  by 

(*) 


■<i.«-»(T)  ■*•(?> 


For  i=j=n  we  get  the  number  of  legal  paths  from  (0,0)  to  (n,n),  namely  the 
Catalan  numbers  (see  p.  9).   Figure  2.19  demonstrates  these  numbers. 


1 

6 

20 

48 

90 

132 

132 

5 

14 

28 

42 

42 

0 

4 

9 

14 

14 

0 

0 

3 

5 

5 

0 

0 

0 

2 

2 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

i      ► 

The  lattice  numbers  a(i,j) 
Figure  2.19 
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This  counting  can  also  be  done  using  the  reflection  principle  (see, 
for  example,  [Yaglom  and  Yaglom,  1964,  problem  83]  or  [Gnedenko,  1962,  p.  36]), 
from  which  it  follows  that 


•<u>-C;JK-08 


the  first  term  stands  for  the  total  number  of  paths  from  (0,0)  to  (i,j),  and 
the  second  term  stands  for  those  paths  that  do  pass  below  the  diagonal,  and 
one  can  show  that  these  forbidden  paths  are  in  a  one-to-one  correspondence  with 
the  paths  from  (1,-1)  to  (i,j)  (using  a  reflection  with  respect  to  y=x-l). 

It  is  clear  that  the  sequence  a(i,j)  can  also  be  defined  recursively 
as  follows: 

(   1  for  i  =  0,  j  >_  0 

a(i,j)  =   <  0  for  j  <  i      (**) 

^a(i,j-l)  +  a(i-l,j)  otherwise. 
Figure  2.19  demonstrates  this  fact. 

It  is  also  clear  that  the  number  of  legal  paths  from  (i,j)  to  (n,n) 
is  equal  to  the  number  of  those  from  (0,0)  to  (n-j,n-j),  and  is  given  by 

afn_i  n_i)  _  j-i  +  1    ftn-i-j+A   j-i+1  /2n-i-j\        (ieieie) 
a(nj,ni)   2n-i-j+l  V   "-j  /   n-i+1  \  n-j  /  "       l  ' 

These  numbers  will  play  a  key  role  in  the  ranking  algorithm  in  the 
next  chapter.  We  summarize  the  above  discussion  in  the  following  lemma: 
Lattice  Lemma: 

(1)  a(i,j),  the  number  of  legal  paths  from  (0,0)  to  (i,j),  satisfies  (**) 

(2)  A  closed-form  expression  for  a(i,j)  is  given  in  (*). 

(3)  The  number  of  legal  paths  from  (i,j)  to  (n,n)  is  given  in  (***). 
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2.6  ORDERED  TREES  WITH  N  EDGES 

We  present  now  several  statistical  results  about  the  class  T  of 

n 

ordered  trees  with  n  edges;  the  discussion  follows  [Dershowitz  and  Zaks,  1979] 

Theorem  2.1 :  The  number  R  (r)  of  trees  in  T  with  root  of  degree  r  is 
n  n  J 

".w-S-ft-T1)- 

Proof:  If  a  tree  T  has  a  root  of  degree  r,  then  the  corresponding  ix     lattice- 
path  (see  p.  18)  starts  with  (0,0)  ->  (0,r)  -*  (l,r)  .  But  the  number  of  legal 
path  from  (l,r)  to  (n,n)  is 


r-1+1  /2n-r-A  _  r  /2n-r-l\ 
n-1+1  '  \n-r  )  "   n  \  n-1  / 


by  the  Lattice  Lemma.  O 

Corollary:  The  expected  root  degree  of  trees  in  T  is    —pr  . 

(Note  that  this  number  is  less  than  3  for  any  n  .  ) 

Proof:  This  expected  number  is  given  by 

n  n   2  /2n-r-l\ 

E  r«R  (r)      1  •  E  r  »l  n-1  ) 

r=0    n     =   n  r=0     V     7 

IT  I  C 

1  n1  n 


But     z  r2-  (2nl^"1)=     -nT2*(n+l)      ;     this  follows  from  the  identity 

)<L  V  P  )\    m  )  '"   (p  +  m  +  ij, 


r=0 

0<k<s 


for  m^t^O,   p,  s^O   (see  [Knuth,  1968,  p.  58]).  Now,  Cn  =  ^t\2[])»  and 
the  result  follows  .  O 

For  this  theorem  and  corollary  see  also  [Ruskey  and  Hu,  1977]. 


Theorem  2.2:  The  total  number  D  (d)  of  nodes  of  degree  d  in  trees  in  T  is 
n  3  n 
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Proof:  By  lemma  2.1(4)  a  node  of  degree  d  corresponds  to  a  run  of  exactly  d 
O's  in  the  x  sequence;  but  if  we  add  a  1  in  front  of  such  a  sequence  we  get  a 
sequence  which  has  the  strong  dominating  property.  Therefore,  by  the  Cycle 
Lemma,  we  have  to  count  the  number  of  occurrences  of  the  pattern  10  1  in  all 
the  distinct  cyclic  permutations  of  n+1  l's  and  n  O's,  and  this  number  is 
given  by 


/n-l+n-d\    /2n- 
[    n-1  ;■  In. 


d-1 
1 


Note  that  if  a  permutation  contains  p  occurrences  of  the  pattern  10  1  ,  it 
will  be  counted  p  times,  as  desired.  Q 

Theorem  2.3:  The  number  L  (k)  of  trees  in  T  that  have  exactly  k  leaves  is 

n  n 

Ln(k)  =  n  VkXk-lj  =  k  Ak-l/\k-lJ  =  nTT\k-lj\  k  )  " 

This  result  has  been  previously  proved  in  [Narayana,  1959]  in 
connection  with  partial  order  on  partitions;  we  present  here  two  new  proofs. 
We > need  the  following  lemma: 

Lemma  2.2:  The  numbers  g.  (x,y,z)  and  gr(x,y,z)  of  ways  to  arrange  x  l's 
and  y  O's  on  a  line  and  on  a  circle,  respectively,  with  exactly  z 
occurrences  of  the  pattern  10,  are  given  by 

9L(x,y,z)  -  (*}(*)  and   ^(x.y.z)  -  \^{t})  ■ 

Proof:  Put  the  10's  first;  the  rest  of  the  O's  and  l's  must  be  put  between 
these  10  's  in  only  one  way:  an  arbitrary  number  of  O's  followed  by  an  arbitrary 
number  of  l's.  The  rest  follows  immediately  using  ordinary  techniques.  O 
First  proof  of  theorem  2.3:  From  the  Cycle  lemma  and  lemma  2.1  it  follows  that 
L  (k)  is  equal  to  the  number  of  way  to  arrange  n+1  l's  and  n  O's  on  a  circle, 
with  exactly  k  occurrences  of  the  pattern  10,  hence  L  (k)  =  gr(n+l,n,k),  and 
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the  result  follows.  /~\ 

Second  proof  of  theorem  2.3:  We  have  to  count  the  number  of  sequences  with  n 
l's,  n  0's  and  exactly  k  10's,  which  also  have  the  dominating  property  (p.  13), 
Without  this  property  we  have  g. (n,n,k)  such  sequences  by  lemma  2.2.  Using 
lattice-path  techniques  it  can  be  shown  (see  [Zaks  and  Dershowitz,  1979])  that 
the  number  of  illegal  paths  is  equal  to  the  number  of  paths  from  (0,0)  to 
(n+l,n-l)  with  k  corners  (see  p.  19),  and  is  therefore  given  by  g. (n+1 ,n-l  ,k); 
hence  L  (k)  =  g,(n,n,k)  -  g.  (n+1  ,n-l  ,k),  and  the  result  follows.    (3 

Corollary  1 :  The  number  of  trees  in  T   with  k  leaves  is  equal  to  the  number  of 

those  with  n+l-k  leaves. 

This  simply  means  that  L  (k)  =  L  (n+l-k).  A  direct  proof  for  this 
r  n      n 

corollary  follows: 

(We  use  notations  from  section  2.3.)   Let  T  e  B  .   If  the  tree  t(T) 

in  T   has  k  leaves,  then  the  corresponding  0,1  sequence  x(T)  has  exactly  k  10's 

and  in  T  exactly  k  leaves  are  left  leaves  (lemma  2.1(1)).  But  if  we  traverse  T 

in  a  right  preorder  (root-right-left),  we  get  a  sequence  in  X   with  n+l-k  10's 

(since  now  the  right  leaves  will  contribute  the  10  patterns,  and  there  are 

exactly  n+l-k  such  leaves),  which  corresponds  to  a  tree  in  T  with  n+l-k  leaves. 

This  correspondence  between  trees  in  T   which  have  k  leaves  and  those  with 

n 

n+l-k  leaves  is  clearly  one-to-one.  /~\ 

n+1 

Corollary  2:  The  average  number  of  leaves  in  trees  from  T   is  —*r-  . 

This  follows  immediately  from  corollary  1.  A  direct  proof  for  this 
corollary  follows : 

Following  lejurar  £.1(1 ) ,  we  count  the  number  of  corners  in  the  lattice- 
paths  in  L  (which  is  equal  to  the  number  of  leaves  in  the  trees  in  T  ). 
n  n 
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Let  h(i,j)  denote  the  number  of  paths  in  L   in  which  the  lattice  point  (i,j) 
is  a  corner.  Clearly,  h(i,j)  is  the  number  of  legal  paths  from  (0,0)  to  (i,j-l) 
times  the  number  of  legal  paths  from  (i+l,j)  to  (n,n).  By  the  Lattice  Lemma 
we  have,  therefore, 

H(iJ)-[(T)-(1ft,)HR=l=H-(^-i-',)] 

for  j  >_  i .  Therefore  the  total  number  of  corners,  N,  is  given  by 

n-1  n 
z  z 
i=l  j=0 

(the  first  term  stands  for  the  number  of  corners  on  the  first  column  x=0;  each 


l[^(2n)+  "j,  j0^n 


path  in  L       has  one  such  corner  ).    By  partitioning  h(i,j)   into  four  parts   (by 
opening  the  brackets),  and  for  each  of  these  parts  using  the  identity 

/s+k\/r-k\  =     /r+s+l\ 


kf0  U/Ur  Ui|-  n-s-°-  m'r-0, 

(see  [Knuth,  1968,  p.  58]),  it  follows  that  N  =  jf").  Dividing  this  total 
number  of  corners  by  the  total  number  of  paths  in  L   (  —pr[   n )  )  we  get  the 


average  number  ^—    of  corners 


O 


We  conclude  with  an  interesting  counting  result,  the  discussion 
of  which  is  the  main  part  of  [Dershowitz  and  Zaks,  1979]: 
Theorem  2.4:   The  number  N  U,d)  of  nodes  of  degree  d  which  reside  on  level  i 


in  trees  in  T   is 
n 


V*"a;    2n-d  \  n+£/  . 
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2.7  BIBLIOGRAPHICAL  NOTES 

Many  articles  have  been  written  about  these  trees,  sequences,  lattice 
paths  and  other  related  combinatorial  structures,  going  back  to  Leonhard  Euler; 
we  deeply  apologize  for  not  mentioning  all  of  them.  An  exhaustive  list  of  454 
references  has  been  compiled  in  [Gould,  1977]. 

An  extensive  discussion  about  ordered  trees  is  found  in  [Knuth,  1968]. 
General  references  for  the  combinatorial  topics  are  [Hall,  1967]  and  [Liu,  1968] 

The  book  of  integer  sequences  [Sloane,  1973]  was  used  quite 
a  lot  in  our  study  [Dershowitz  and  Zaks,  1979],  part  of  which  is  discussed  in 
section  2.6. 

Two  general  techniques  are  used  many  times  in  studies  of  lattice-paths 
and  integer  sequences;  they  are  the  reflection  principle  and  the  Cycle  Lemma. 
For  the  reflection  principle  we  gave  two  references  in  p.  27  .  The  Cycle  Lemma 
goes  back  to  [Motzkin,  1948],  which  discusses  also  generalized  ballot  problems. 
Discussions  about  this  lemma  and  various  generalizations  are  found  in  [Raney, 
1960]  and  [  Taka'cs,  1967,  chapter  1].  [Bergman,  1978],  [Sands,  <*978]  and 
[Singmaster,  1978]  are  three  recent  interesting  short  articles  with  new  proofs 
to  the  lemma. 

In  manipulating  combinatorial  summations  we  found  [Knuth,  1968] 
yery   helpful  (see  p.  28  and  p.  31  ,  and  also  p.  63). 

We  mention  a  few  more  references  that  discuss  trees,  sequences  and/or 

lattice-paths:  [Etherinqton,  1938],  [Lyness,  1941],  [Erdos  and  Kaplansky,  1946], 
[DeBruijn  and  Morselt,  1967],  [Silberger,  1969],  [Klarner,  1969  and  1970], 
[Read,  1972]  and  [Chorneyko  and  Mohanty,  1975]. 

The 'fun  with  lattice-paths'  in  [Grossman,  1946  and  1950]  and  the 
description  in  [Gardner,  1976]  are  yery   entertaining. 
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[Carl i tz,  1969a]  first  gave  the  closed-form  expression  for  t(k,n), 
that  was  mentioned  earlier  (recursively,  without  a  closed-form)  in  [Riordan, 
1957]. 

A  few  references  for  the  ballot  problem  are  [Whitworth,  1878], 
[Dvoretzky  and  Motzkin,  1947],  [Mohanty  and  Narayana,  1961]  and  [Carlitz,  1969b] 
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CHAPTER  3 
GENERATING  TREES:   PART   I 


3.1   INTRODUCTION 


We  explained  previously  why  one  would  like  to  study  generating, 

ranking  and  unranking  algorithms  for  certain  classes  of  trees  (see  sections 

1.1  and  2.1).  We  discuss  these  algorithms  in  the  next  two  chapters.  In 

this  chapter  we  use  the  correspondences  between  the  trees  in  B  ,  the  integer 

sequences  in  X„  and  Z  ,  and  the  lattice-paths  in  L  ,  all  of  which  were 
n         n      n  r       n 

presented  in  section  2.3.  Using  the  sequences  in  Z  ,  we  develop  an 
algorithm  that  generates  the  trees  in  B  in  order,  we  show  how  to  find  the 
position  of  a  given  tree,  and  we  show  how  to  find  a  tree  given  its  position. 
The  linear  ordering  of  trees  used  throughout  this  chapter  is  the  a-order. 

We  would  like  to  note  that,  using  these  generating,  ranking  and 
unranking  algorithms  for  sequences,  one  can  convert  them  into  algorithms 
that  will  directly  operate  on  the  trees.  This  would  not  add  but  several 
programming  details  to  our  discussion,  and  we  prefered  not  to  include  this 
here. 

We  also  show  (theorem  3.1)  that  our  order  of  generating  these  trees 
(the  a-order)  is  the  same  order  in  which  they  are  generated  by  two  existing 
algorithms. 

The  generating  algorithm  is  generalized  to  k-ary  trees.  The  average 

number  of  comparisons  in  generating  the  sequences  corresponding  to  the  k-ary 

trees  with  n  internal  nodes  is  shown  to  be  less  than  two  (independent  of  n,  or 

even  of  k).  Fixing  k  and  increasing  n,this  number  approaches  (1--* 1 )   , 

kK 
which   is   4/3   f°r  binary  trees,  and  is  smaller  than  1.1  for  k>4. 
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Section  3.2  discusses  the  connection  between  our  results  and  other 
known  results  on  this  subject.  In  section  3.3  the  generating  algorithm  is 
discussed.  Ranking  and  unranking  algorithms  are  the  subjects  of  sections  3.4 
and  3.5,  respectively.  Extension  of  the  generating  algorithm  from  binary  to 
k-ary  trees  can  be  found  in  section  3.6,  and  its  analysis  is  the  subject  of 
section  3.7.  Section  3.8  presents,  as  an  example,  the  14  trees  in  B,,  Bi  and 
T-,  together  with  their  various  corresponding  sequences. 

3.2  RELATIONS  TO  EXISTING  ALGORITHMS 

Algorithms  for  generating  binary  trees  and  k-ary  trees  have  been 
extensively  studied  in  the  past  few  years.  One  approach  uses  permutations 
(see  pp.  20-21).  In  [Trojanowski ,  1977a]  the  generating,  ranking  and  unranking 
steps  are  discussed,  using  the  a-order  (see  theorem  3.1).  [Knott,  1977] 
discusses  the  ranking  and  unranking  algorithms  using  the  a'-order.  This  order 
is  also  used  in  [Trojanowski,  1977a]  for  ranking  and  unranking  in  the  binary 
case.  See  also  [Rotem  and  Varol ,  1977]  for  a  related  work.  A  second  approach 
is  to  use  level  numbers  (see  pp.  20-21).  Generating,  ranking  and  unranking  in 
the  binary  case  are  discussed  in  [Ruskey  and  Hu,  1977]  and  are  extended  to 
k-ary  trees  in  {Ruskey,  1978],  both  using  the  a-order  for  trees  (see  theorems 
3.1  and  3.5).  [Col bourn,  1977]  contains  many  references  and  a  brief  summary 
on  the  subject  of  graph  generation. 

Our  approach  for  generating  trees  makes  use  of  the  integer  sequences 
introduced  before  (section  2.3).  More  precisely,  we  use  the  x  and  z  sequences 
for  generating,  ranking  and  unranking  binary  trees  and,  more  generally,  k-ary 
trees.  Treating  these  classes  of  trees  using  these  integer  sequences  seems  to 
be  the  simplest  and  most  intuitive  approach  to  deal  with  problems  of  the  kind 
discussed  here. 
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The  following  theorem  establishes  the  relations  mentioned  above: 
Theorem  3.1 :  Let  T,  T1  e  B  .  The  following  are  equivalent: 

(1)  T  <  T'  . 

a 

(2)  x(T)  <  x(T')  (see  p.  14). 

(3)  z(T)  >  z(T')  (see  p.  16). 

(4)  T  <  T'  according  to  Ruskey-Hu's  level  sequence  (see  pp.  20-21). 

(5)  T  <  T'  according  to  Trojanowski 's  permutation  (see  pp.  20-21). 
Proof:  (1)  -*-+■  (2)  follows  from  the  Ordering  Lemma. 

(2)  ■*-»■  (3)  follows  immediately  from  the  definitions  of  the  z  sequences. 

(2)  ~  (4) 
x  <  x'  iff  for  some  l,   l<£<2n,  xi  =  x'.   for  i  =  1 ,2,  . . .  ,  £-1 ,  and  x£  =  0<1  = 
x  .  Let  us  look  at  those  parts  of  T  and  T'  corresponding  to  the  first  I  -   1 
nodes  traversed  in  preorder.  Those  parts  must  be  isomorphic,  otherwise  we 
could  not  have  (x-j ,  . . . ,  x  |)=(x'  , . . .  >x^_-i),  by  the  equivalence  (1)**(2).  Hence,  x<x 
iff  all  the  leaves  among  the  first  i   -  1  nodes  (in  preorder  traversing)  of 
T  and  T1  are  in  the  same  levels,  correspondingly,  but  the  leaf  now  scanned  in 
T  (corresponding  to  x  =  0)  is  in  a  lower  level  than  that  of  the  leaf  that 
will  be  next  scanned  in  T1  (this  leaf  will  be  a  successor  of  the  internal  node 
corresponding  to  x!  =  1);  this  happens  iff  the  level  sequence  of  T  is  less 
than  that  of  T1  in  Ruskey-Hu's  ordering. 

(2)  ~  (5) 

x  <  x1  iff  for  some  a   x.  =  x'.  for  i  =  1 ,  2 £-1  and  x  =  0  <  1  =  x'. 

This  happens  iff  while  scanning  both  T  and  T'  in  preorder  we  meet  internal 
nodes  and  leaves  correspondingly,  but  the  5,-th  scanned  node  is  a  leaf  b  in 
T  and  an  internal  node  b  in  T' .   This  happens  iff  while  reading  the  pre- 
order labeling  of  the  internal  vertices  of  T  and  T'  in  inorder,  the  two 
corresponding  permutations  p  and  p'  coincide  up  to  this  point,  and  now 
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in  p  we  add  the  label  of  b's  father  as  the  next  element,  while  in  p'  we 
add  a  number  which  is  at  least  as  big  as  the  label  of  b;  but  the  labels 
of  b's  father  and  b's  father  are  the  same.  This  happens  iff  p  <  p'.    /"\ 

Corollary:  In  order  to  generate  the  trees  in  B   in  a-order,  it  is  sufficient 

to  generate  the  sequences  in  X  lexicographically  or  the  sequences  in  Z 

an ti lexicographically. 

Note:  It  should  be  clear  how  the  conversion  of  a  sequence  into  its 

corresponding  tree  is  made,  and  this  is  not  discussed  here. 

It  is  of  interest  to  compare  the  feasibility  condition  in  [Ruskey 
and  Hu,  1977]  with  the  following: 

Theorem  3.2:  For  a  0,1  sequence  x  =  {*•}?  the  following  are  equivalent: 
(1)  x  e  Xn- 
{?.)     Replacing  any  (or  the  leftmost)  100  pattern  in  x  by  0,  as  long  as 

possible,  results  in  a  sequence  (lO)*3  for  some  p,  1  <_  p  <_n. 
(3)  Erasing  any  10  pattern  in  x,  as  long  as  possible,  results  in  the 
empty  sequence. 

Pl°°_t:   Follows  immediately  from  the  definitions. 

3.3  THE  GENERATING  ALGORITHM 

We  present  now  an  algorithm  that  generates  the  sequences  in  Z 

lexicographically.  Recall  that  corresponding  to  the  regular  binary  trees 

with  n  internal  nodes  we  have  the  inteqer  sequences  z  ={z.}?e  Z   in 

3    M  l  1    n 

which  0  <  z-|  <  z2  <...<  zn  and  z.  1  2i-l  for  each  i.  Generating  these 
sequences  in  order  results  in  generating  the  corresponding  trees  in  a-order 
from  the  last  tree  to  the  first  one  (see  theorem  3.1).  If  one  prefers,  he 
can  easily  convert  it  into  an  algorithm  that  will  generate  these  sequences 

antilexicographically  (starting  with  z=l ,3 2n-l  and  ending  with  z=l , 

2,...,n  ). 
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In  generating  sequences  lexicographically  the  obvious  technique  is 
usually  used:  Given  a  sequence  u  =  u-jUp.-.u  ,  to  find  the  next  sequence 
we  scan  u  from  the  right  until  we  find  the  first  u.  that  can  be  incremented; 
then  this  u.  is  incremented,  and  to  its  right  we  append  the  first  possible 
subsequence.  In  the  following  algorithm  the  current  sequence  z  =  z,z«...z 
is  scanned  from  the  right  in  step  2.  If  z  =  l,3,...,2n-l  (j=0;  last 
sequence)  then  we  stop.  (For  that  reason  we  introduced  the  sentinel  zn=0  .) 
Otherwise,  we  find  the  rightmost zn-  such  that  z.<  2j-l,  increment  it  by  1, 
and  append  to  its  right  the  first  possible  subsequence   z  .+1 ,  z.+2,  ...  , 

J        J 

z.+n-j.  The  algorithm  starts  with  the  first  sequence  z  =  l,2,...,n,  and 
proceeds  from  a  sequence  to  its  successor  until  it  reaches  the  last  sequence 
z  =  1,3,  ...  ,2n-l  ,  as  follows: 


Al 


gorithm  GENERATE.BINARY  (generating  Z  lexicographically,  given  n) 

1 .  (first  sequence) 

for  i  *-  0  to  n  do  z.  +-  i  ; 
(z0  is  a  sentinel ) 

2.  (scanning  the  current  sequence  z={z.}) 
J  «-  n  ; 

while  z.  =  2j-l  d£  j  «-  j-1  ; 

(j  points  now  at  the  rightmost  index  i  for  which  z.  <  2i-l) 

if  j=0  then  stop  (last  sequence) 

3.  (next  sequence) 

(z.  unchanged  for  i  <  j) 

zo  -  zj+1  ; 

(changing  the  entries  to  the  right  of  z.) 

•J 

if.  j  <  n  then  for  i  «-  j+1  to  n  do  z.  «-  z.  -|+1  ; 

4.  (going  to  generate  the  next  sequence,  if  possible) 
goto  2. 
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For  example,  the  14   sequences  in  Z.,  corresponding  to  the  regular 

binary  trees  with  4  internal  nodes,  B,,  are  generated  by  the  algorithm 

GENERATE_BINARY  as  follows  (we  undelined  the  rightmost  z.  (see  discussion); 
also,  zQ=0  is  omitted): 


Position 

Sequence 

1 

1234 

2 

1235 

3 

1236 

4 

1237 

5 

1245 

6 

1246 

7 

1247 

8 

1256 

9 

1257 

10 

1345 

11 

1346 

12 

1347 

13 

1356 

14 

1357 

For  more  details  on  this  example  see  section  3.8 


3.4  THE  RANKING  ALGORITHM 

We  now  show  how  to  compute  the  function  INDEX(z)     that  maps  a 
sequence  in  Z   to  its  correspondence  position  (in  the  lexicographic  ordering 
of  Z  ).  It  follows  that,  given  a  tree  T  e  B  ,  its  position  INDEX(I)   may 
be  found  by  INDEX{T)   =  C  -  index{z)   +1.  (We  use  the  same  function  name 
INDEX   for  the  function  that  maps  an  element  to  its  position  in  a  linear 
ordered  set  .  ) 

We  will  make  use  of  the  numbers  a(i,j)  (see  the  Lattice  Lemma); 
more  precisely,  we  will  need  the  numbers  b(n,t)  of  legal  paths  from  (0,0) 
to  (n,n)  that  start  in  more  than  t  steps  up  (see  figure  3.1). 
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(O.t+1) 


(0,0) 


Paths  from  (0,t+1)  to  (n,n) 
Figure  3.1 

By  the  Lattice  Lemma  this  number  b(n,t)  is  given  by 

hfn  +\   a^n  +  l  n)   t+2  /  2n-t\  _  t+2  /2n-t-l 
b(n,t)  -  a(n-t-l.n)  -  j^t^-t^)  -  ^  n 

Note  that  the  b(i,j)'s  satisfy  the  recurrence  relation 


b(1,j)  = 


C.  = 


1  /2i 


i    1+1  V  1 


for  j=i-l ,  i  >_  1 
for  j=0,  11 


b(i,j+l)  +  b(i-l.j-l)  otherwise 


See  figure  3.2,  and  compare  it  with  the  corresponding  numbers  a(i,j) 
in  p.  26  . 
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i  \ 

0 

1 

2  1 

1 

3 

4 

5 

6 

7  1 

1 

1 

2 

2 

1 

3 

5 

3 

1 

4 

14 

9 

4 

1 

5 

42 

28 

14 

5 

1 

6 

132 

90 

48 

20 

6 

1 

7 

429 

297 

165 

75 

27 

7 

1 

8 

1430 

1001 

572 

275 

no 

35 

8 

1 

The  sequence  b(i  ,j) 
Figure  3.2 

Let  p=lNIT{z)   denote  the  largest  i  for  which  z.  =  i  (note  that  we 
always  have  z,=l,  hence  INIT{z)   >_1);  this  means  that  the  corresponding 

n  1 

lattice-path  s.'     (see  p.  18)  starts  with  p  steps  up.  Let  z  =  {z\ }  ^   be 
the  sequence  obtained  from  z  by  deleting  z   and  setting  z.  «•  z.+-|  -  2 
for  j  >  p  (and  z,  +   z.   for  j  <  p)  ;  this  corresponds  to  'cutting' 
the  first  corner  in  the  corresponding  lattice-path  and  concatenating  the 
two  pieces.   The  following  is  the  key  to  the  ranking  algorithm: 
Theorem  3.3:  lNDEX{z)   can  be  computed  recursively  by 

f  1  if  p  =  n 


INDEX{z).    = 


b(n,p)  +  lNDEX{z)         if  p  f   n 


Proof:  By  step  1  of  the  algorithm  GENERATE-BINARY  it  follows  that 
index{z)  =   1  when  p  =  n.  For  any  other  sequence  z  we  have 

lNDEX[z)   =   [  the  number  of  sequences  z  with  init{z)  >_  p+1  ] 

+  [  the  position  of  z  among  all  the  sequences  z  with  init(z)   =fl 
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by  the  nature  of  the  lexicographic  ordering.  We  show  that  the  first  term  in 
this  sum  is  equal  to  b(n,p)  and  that  the  second  one  is  INDEX (z) . 

That  the  first  term  is  equal  to  b(n,p)  follows  from  the  fact  that  ch 
this  number  is  equal  to  the  number  of  legal  paths  that  start  with  more  than 
p  steps  up,  which  is  precisely  b(n,p).  As  for  the  second  term,  all  the 
sequences  z  for  which  init(z)   =  p  correspond  to  0,1  sequences  x  =  l^Oy. 
The  sequence  z  as  described  above  is  exactly  the  one  corresponding  to 
x  =  lp~  y;  hence  the  position  of  z  among  all  the  sequences  z  with  JNiT{z)=p 
is  exactly  INDEX(z).   (For  the  last  part  see  also  theorem  4.1  .) 


o 


Example:  For  the  tree  in  figure  3.3  we  have 


T  : 


x(T)  =  1111000101100100 
z(T)  =  1  2  3  4  8  10  11  14 


A  tree  in  B 


6 


Figure  3.3 
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IM?£J(1,2,3,4,8,10,11,14)  =  b(8,4)  +  INDEX{1 ,2,3,6,8,9,12) 
INDEX{1 ,2,3,6,8,9,12)  =  b(7,3)  +  INDEXtf  ,2,4,6,7,10) 


17^*0,2,4,6,7,10) 
rM?£*(l,2,4,5,8) 
IM?£*0  .2,3,6) 
IM?£*0»2,4) 
INDEXO  ,2) 


=  b(6,2)  +  IM?£*0  .2,4,5,8) 

=  b(5,2)  +  INDEX[1, 2 ,3,6) 

=  b(4,3)  +  INDEXC\,2,4) 

=  b(3,2)  +  INDEXC\,2) 
=   1 


SO 


INDEX(1,2, 3, 4, 8, 10, 11, 14)   =  b(8,4)  +  b(7,3)  +  b(6,2)  +  b(5,2)  + 

b(4,3)  +  b(3,2)  +  1 

=  110  +  75  +  48  +  14  +  1  +  1  +  1  =  250  . 
Therefore  INDEX{T)   =  Cg  -  250  +1  =  1181  . 

In  figure  3.4  the  lattice-path  i{J)   is  shown.  Each  lattice  point 
is  labeled  with  the  number  of  legal  paths  from  it  to  (n,n)  ((8,8)  in  this 
case). 

Note:  The  numbers  used  in  computing  index(z)   are  exactly  those  directly 
above  the  corresponding  path,  plus  an  additional  1  (to  make  the  counting  start 
at  1  rather  than  0).  In  this  example  they  are  the  numbers  110,  75,  48,  14,  1 
and  1.  This  observation  is  due  to  D.  Richards  and  has  been  further  extended 
in  [Zaks  and  Richards,  1977]  (see  discussion  in  the  next  chapter,  sections  4.4 
and  4.5). 
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90 

42 

14 
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42 
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1430 

429 

1430 

■i 



*- 

An  example  for  the  ranking  function 
Figure  3.4 
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3.5  THE  UNRANKING  ALGORITHM 

Given  a  number  i,  1  <.  i  <_  C  ,  we  show  in  this  section  how  to  find 
the  i-th  tree  in  B  .  As  explained,  it  suffices  to  do  this  for  the  sequences 
in  ZR  (corollary,  p.  37). 

The  algorithm  that  we  present  follows  immediately  from  the  way  we 
calculated  index{z)   using  theorem  3.3,  so  we  will  skip  its  proof  here.  We 
first  illustrate  the  algorithm  by  the  tree  from  the  previous  example: 

Suppose  we  want  to  find  the  1181-st  tree  T  in  Bft.  By  the  previous 
discussion,  we  have  to  find  the  sequence  z  in  Zfi  such  that  INDEX(z)   = 
Cg  -  INDEX(J)   +  1  =  250  (in  other  words,  we  have  to  find  the  250-th 
sequence  in  Zq).  By  theorem  3.3  we  know  that 

250  =  b(8,p)  +  INDEX{z). 


We  choose  p  such  that   b(8,p)  <  250  <_  b(8,p-l);  here  p=4,  and  we  get 
250  =  110  +  lNDEX{z),   or  INDEX{z)  =   90,  where  z  is  a  sequence  in  1-,. 
Next  we  get  90  =  b(7,3)  +  INDEX(z),   or  INDEX{1)  =   15,  and  so  on. 
At  the  end  we  have 

250  =  b(8,4)  +  b(7,3)  +  b(6,2)  +  b(5,2)  +  b(4,3)  +  b(3,2)  +  b(2,l)  . 
From  this  decomposition  of  250  we  reconstruct  the  sequence  z: 

The  last  1  tells  us  that  at  the  \/ery   end  (of  computing  INDEX(z)) 
we  had  {1,21.  Then  b(3,2)  tells  us  that  a  step  before  we  had  the  sequence 
{l,2,y},  and  that  after  omitting  the  2  and  setting  y  ■*-   y-2  we  got  {1,2}. 
This  means  that  we  have  {1,2,4}.  Continuing  in  this  manner,  we  get: 


b(4,3) 
b(5,2) 
b(6,2) 
b(7,3) 
b(8,4) 


1,2,3,6} 

1,2,4,5,8} 
1,2,4,6,7,10} 
1,2,3,6,8,9,12} 
1,2,3,4,8,10,11,14}  =  z. 
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Using  the  geometric  interpretation  shown     above  (pp.  43-44),  it 
should  be  clear  intuitively  how  the  unranking  algorithm  works;     tracing  the 
corresponding  lattice  points,  following  this  unranking  algorithm,  will  make 
this  point  clearer.     We  now  turn  to  a  formal  description  of  this  algorithm. 

The  following  algorithm  converts  a  number  i   to  the  i-th  sequence 
in  Z  ,     given  i   and  n: 
Algorithm  UNRANK.BINARY 

1.  A  «-  1;  J  «-  n; 

2.  find  m.  for  which  b(j,m.)  <  A<_b(j,m.-1)  ; 

3.  A  -«-  i  -  b(j,m.)  ; 

J  +  j-l  ; 

vf  A  >  0  then  goto  2  ; 

n 
(we  now  have  the  decomposition  i  =   z  b(j,m.)  ) 

J 


4.  z  <-  {1,2,  ...  ,  jQ}  ; 

s-mq   ; 

5.  (changing  z) 

(z.  unchanged  for  i  <  s) 

zi+1  *■  z^  +  2  for  i  =  q,q-l,...,s  ; 

zs  -  s  ; 

6.  q  •*-   q+1  ; 

rf_  q  <_  n  then  s  «-  m  and  goto  5 . 


J=J0 
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3.6  GENERATING  K-ARY  TREES 

We  generalize  now  the  generating  algorithm  to  the  class  T(k,n)  of 
regular  k-ary  trees  with  n  internal  nodes.  The  proofs  in  this  section  are 
omitted,  being  similar  to  the  corresponding  previous  ones. 

Given  a  tree  T  e  T(k,n),  we  define  the  sequences  x(T)  and  z(T)  in 
a  manner  similar  to  the  binary  case;  figure  3.5  illustrates  these  sequences. 
A  sequence  related  to  x(T)  is  associated  with  a  planted  planar  tree  in  [Klarner, 
1970]  .  > 


0    0    0   0 


0   0 


x(T)  =  11000001000100100000 
z(T)  =  1  2  8  12  15 


Labeling  a  k-ary  tree 
Figure  3.5 
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kn 
Let  a  =  {a-},   be  a  sequence  of  k  l's  and  (k-l)n  O's;  it  has  the 

k-dominating  property  if  in  each  prefix  {a-}?   the  number  of  l's  is  not 

smaller  than  £•  .   In  other  words,  if  this  prefix  contains  s  l's,  then  it 

contains  at  most  (k-l)s  O's.  Note  that  for  k=2  we  get  the  dominating 

property  used  before. 

The  following  thsorem  makes  the  desired  connection  between  k-ary 

trees  and  the  corresponding  sequences  and  lattice-paths: 

Theorem  3.4:  The  following  sets  are  in  one-to-one  correspondence  with  one 

another: 

1.  T(k,n),  the  regular  k-ary  trees  with  n  internal  nodes, 

2.  X(k,n),  the  0,1  sequences  with  n  l's  and  (k-l)n  O's  that 

have  the  k-dominating  property, 

3.  Z(k,n),  the  integer  sequences  {z-}1?   such  that  0  <  z,   < 

z~<   ...  <  z   and  z.  <_  ki-k+1  for  each  i,  and 

4.  L(k,n),  the  lattice-paths  from  (0,0)  to  ((k-l)n,n)  that  do  not 

pass  below  the  diagonal  (k-l)y=x. 

The  following  generalization  of  theorem  3.1  is  the  basis  to  our 
generating  algorithm.  The  proof  follows  immediately  from  that  of  theorem 
3.1  .  Note  that  in  [Trojanowski ,  1977a  and  1977b]  the  permutation  associated 
with  a  tree  is  not  immediately  extended  from  the  binary  case. 
Theorem  3.5:  Let  T,  T'  e  T(k,n).  The  following  are  equivalent: 

(1)  T  <  T' 

a 

(2)  x(T)  <  x(T')  (see  p.  47) 

(3)  z(T)  >  z(T')  (see  p.  47) 

(4)  T  <  T'  according  to  Ruskey-Hu's  level  sequence  (see  pp.  20-21) 
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Corollary:  In  order  to  generate  the  trees  in  T(k,n)  in  order,  it  suffices  to 

generate  the  sequences  in  X(k,n)  or  Z(k,n)  in  order. 

Notes:  1.  It  should  be  clear  how  to  convert  a  sequence  to  the  corresponding 

tree. 
2.  Theorem  3.2  applies  also  here,  with  100  and  10  replaced  with 

10k  and  10k-1,  respectively. 

Following  the  above  discussion,  we  get: 
Algorithm  GENERATE.K-ARY   (generating  Z(k,n)  lexicographically,  given  k  and  n) 

Exactly  like  algorithm  GENERATE.BINARY  (p.  38)  with  one  slight 
modification:  in  step  2  replace  the  upper  bound  ' 2 j - 1 '  with  'kj-k+1'  . 

For  example,  the  12  sequences  in  Z(3,3),  corresponding  to  the 
regular  ternary  trees  with  3  internal  nodes,  are  generated  by  this  algorithm 
as  follows  (we  underlined  the  rightmost  z.  discussed  before;  zn=0  is  omitted): 


Position 

Sequence 

1 

123 

2 

124 

3 

125 

4 

126 

5 

127 

6 

134 

7 

135 

8 

136 

9 
10 

137 
145 

11 

146 

12 

147 
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3.7  ANALYSIS  OF  THE  GENERATING  ALGORITHM 

We  now  study  the  complexity  of  algorithm  GENERATE_K-ARY.   It  is 
clear  that  the  work  done  by  the  algorithm  is  proportional  to  the  number  of 
comparisons  made  in  its  step  2 (namely,  checking  whether  z.  <  kj-k+1  for 
the  current  j);  therefore  we  study  now  the  number  of  these  comparisons. 

After  we  have  generated  the  last  sequence  (which  is  z  =  1,  1+k, 
l+2k,  ...  ,  l+(n-l)k  )  in  step  3,  we  come  back  to  step  2,  in  which  we  have 
to  scan  this  sequence  from  right  to  its  very   left;  and  then,  having  found 
no  j  >0  for  which  z.  <  kj-k+1,  we  stop;  therefore,  in  its  worst  case, 
the  algorithm  makes  n  comparisons  for  one  sequence. 

As  for  the  average  case,  we  show  that  the  average  number  of 
comparisons  per  sequence  is  less  than  two,  independent  of  n  or  even  of  k. 

Let  p.  denote  the  number  of  sequences  in  which  j  comparisons  are 
made  by  the  algorithm.  The  expected  number  of  comparisons  is  then  given  by 

£  J*P. 


(*) 


j>l    J 
t(k,n) 

Lemma  3.1:  P-j  =  t(k,n)  -  t(k,n-l). 

Note:  For  k  =  2  this  is  simplified  to  -f^^2"'?)   b^  t(2'n)  =  bn  =  "nTf  O  • 

Proof:  p.,  counts  those  sequences  z  ={z.},  in  which  z  <  kn  -  (k-1).  This 
means  that  the  corresponding  x  sequence  ends  with  0  ,  or  that  the  corresponding 
lattice  path  passes  through  C((k-1 )n-k,n) .  Denote  the  number  of  paths  from 
A(0,0)  to  a  point  U,  not  going  beyond  the  line  AB,  by  P(U).  Then  we  have 
(see  figure  3.6): 

P(C)  =  P(D((k-l)(n-l),n))  -  P(E((k-l)(n-l),n-l))  =  P(B)  -  P(E)  = 

t(k,n)  -  t(k,n-l). 
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I 

0         0 

0 

B((k-l)n, 
0         0^ 

n) 

0 

o1 

c 

D 

0 

0 

if 

T 

0 

0 

0 

0 

.'1 

1 

1 

»- 

A(0,0) 


The  path  corresponding  to  the  tree  in  figure  3.5 

Figure  3.6 


Denote 


k  = 


(k-1) 


k-1 


Lemma  3.2: 


'1 


t(k,n) 


->  1  -  k  as  n  ->   °°,  for  any  k  >_  2, 


Note:  This  limit  is  3/4  for  k  =  2. 

!      fk(n-l) 
Proof:  t(k,n-1)_  (k-l)(n-1)+l  '  V  n-1 


t(k,n) 


1 
(k-l)n+l 


(kn") 


(k-l)n  +  1   (k-l)n   (k-l)n  -  1 
(k-l)n  +  2  "   kn   '   kn  -  1 


->  1 


k-1 


k-1 


k  as  n  -> 


(k-1)n  -  (k-2)      1 

kn  -  (k-2)  '  kn  -  (k-1) 


Hence,  by  lemma  3.1  fTkrT)  "*"  ^  "  k  as  n 


O 
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This  means  that  in  most  cases  only  one  comparison  is  made;  for  example, 

this  happens  in  about  75%  of  the  cases  for  k  =  2,  85%  of  the  cases  for  k  =  3 

and  92%  of  the  cases  for  k  =  5. 

Lemma  3.3:  p.  =  t(k,n-j+l)  -  t(k,n-j)  for  l<_j<_n-l,  and  p  =  1,  for 

k  >  2. 

Note:  For  k  =  2  p.  is  simplified  to  ^4^-  („"jf^). 

Proof:  Similar  to  that  of  lemma  3.1.  (3 

Lemma  3.4:  t(j^n)  +   (1-k)  k  j_1  as  n  -  ». 

3 
Note:  This  limit  is  ».  for  k  =  2. 

Proof:  By  lemma  3.2  this  holds  for  j  =  1.  Assume  it  holds  for  j  <  I  we  have 
by  the  previous  lemma: 

p£  =  t(k,n-£+l)  -  t(k,n-0 

p£_1  =  t(k,n-£+2)  -  t(k,n-£+l) 

Hence 

p£    t(k,n-£+l)  -  t(k,n-£)   t(k,(n-l )-£+2)  -  t(k,(n-l )-£+! )  . 
t(k,n)       t(k,n)  t(k,n-l) 

t(k,n-l) 
t(k,n)  * 

But  the  first  term  approached  (1  -  k)k  ~  by  the  induction  hypothesis,  and 
the  second  term  -   k  (see  proof  of  lemma  3.2);  hence 

P*    (1  -  ~k)k*  ,    • 


t(k,n) 

and  the  lemma  is  thus  proved.  /~\ 

We  now  return  to  (*): 

Theorem  3.6:  Algorithm  GENERATE_K-ARY  requires comparisons  per 

1  -  k 
sequence,  for  n  -*■  °°. 
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Note:  This  limit  is  4/3  for  k  =  2. 

Proof'   Following  the  previous  discussion,  we  get 


n       n-1 

+  tT¥7nT 


E  jp,      E   jp. 

3=1  J    JfJ i  +  _n 


t(k,n)     t(k,n) 


n-1  n-1       n-1 

E  jp,  E   p,       E  p. 

j  =  l   J  =  J  =  l   J  +  J=2  J   +  ...  - 

t(k,n)  t(k,n)     t(k,n) 

n-1  n-1  p 


00                             00 

E             E 

(l-k)kJ_1 

a=l       j=a 

,          "**  .          oo 

(1-k)     E 
a=l 

1-k 

E       k3"1    = 

a=l 

1 

1-k 
Note  that 


i  k 


i-N)M-N)  ;    i 


kk     k-1     (k-l)e  ' 


hence  we  can  upperboundO-k)"  by  (1-1 /(k-1 )e) 


o 
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As  an  illustration  of  our  analysis,  consider  the  case  where  k  =  2 

and  n  =  4.   The  algorithm  generates  the  sequences  1234,  1235,  1236,  1237, 

1245,  1246,  1247,  1256,  1257,  1345,  1346,  1347,  1356,  1357.  In  nine  of 

these  sequences  1  comparison  is  made  when  forming  the  next  sequence,  three 

of  them  require  2  comparisons,  one  requires  3  (namely,  1257),  and  the  last 

one  (1357)  requires  4  comparisons.  Hence,  the  algorithm  uses  on  the  average 

1.571  comparisons  per  sequence.  More  numerical  results  are  shown  in  the 

following  table  (the  column  for  "n  -»•  «"  is  the  limiting  value  — -): 

1-k 


3 

4 

5 

6 

7 

n  ■*  oo 

2 

1.600 

1.571 

1.524 

1.485 

1.457 

1.333 

3 

1.333 

1.291 

1.260 

1.241 

1.229 

1.174 

4 

1.227 

1.193 

1.172 

1.160 

1.153 

1.118 

5 

1.171 

1.144 

1.129 

1.120 

1.115 

1.089 

Average  number  of  comparisons  made  by 
algorithm  GENE RAT E_K-ARY 
Figure  3.7 

As  was  pointed  out  to  us  by  [Paterson,  1978],  the  average  case 

analysis  can  also  be  argued  as  follows:  it  is  clear  that  z   is  changed 

n 
t(k,r)  times,  hence  the  total  number  of  changes  is  z  t(k,r).  Using 

r=l 
Stirling's  formula  for  n!  we  get 


t(k,n) 


-n 


|/2Trn(l-l/k)' 
n 


as  n  ■*  oo  ,  from  which  we  get    {  z  t(k,r)>/t(k,n)  +  1+k+k  + 

r=l 


1-k 
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3.8  AN  EXAMPLE 

We  conclude  this  chapter  by  showing  the  14  regular  binary  trees  in 
B.  with  the  corresponding  trees  and  sequences.  For  each  tree  T  in  B.  we 
show  the  following: 

1.  Its  position  with  respect  to  the  a-order  (p.  22). 

2.  Its  position  with  respect  to  the  a'-order  (p.  22). 

3.  Its  position  with  respect  to  the  g-order  (p.  24). 

4.  The  corresponding  tree  T1  in  Bl  (p.  15). 

5.  The  corresponding  tree  t(T)  in  T.  (p.  16). 

6.  The  corresponding  0,1  sequence  x(T)  in  X.  (p.  15). 

7.  The  corresponding  sequence  z(T)  in  1.     (p.  16). 

8.  The  corresponding  sequence  s(T)  (p.  20). 

9.  The  corresponding  level  sequence  of  Ruskey  and  Hu  (p.  21). 
10.  The  corresponding  permutation  (p.  21). 

The  example  is  shown  in  the  following  three  pages. 
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CHAPTER  4 
GENERATING  TREES:   PART  11 

4.1  INTRODUCTION 

In  this  chapter  we  extend  the  results  from  chapter  3.  In  the  first 
half  of  the  chapter  we  study  generating,  ranking  and  unranking  algorithms  for 
k-ary  trees  in  s-order  using  the  x  and  z  sequences.  In  the  second  half  we 
generalize  the  discussion  of  section  3  about  binary  trees  to  the  classes  of 
trees  that  have  n.  nodes  of  degree  k . ,  i  =  l,2,...,t  for  some  t. 

The  first  half  of  the  chapter  is  treated  in  sections  4.2  (generating) 
and  4.3  (ranking  and  unranking),  and  the  second  half  can  be  found  in  sections 
4.4  (generating)  and  4.5  (ranking  and  unranking). 

4.2  K-ARY  TREES  IN  g-ORDER:  GENERATION 

We  use  the  sequences  x(T)  and  z(T)  corresponding  to  a  regular  k-ary 
tree  T  with  n  internal  nodes  (see  p.  47);  recall  that,  according  to  our 
notations,  T  e  T(k,n),  x(T)  e  X(k,n)  and  z(T)  e  Z(k,n).  We  show  how  to 
generate  the  sequences  in  Z(k,n)  in  order  according  to  the  Hebrew 
lexicographic  ordering;  this  corresponds  to  generating  the  sequences  in 
X(k,n)  according  to  the  same  ordering,  and  to  generating  the  trees  in  T(k,n) 
in  3-order  (see  p.  24).  It  is  clear  how  to  generate  the  sequences  in  Z(k,n) 
in  the  Hebrew  lexicographic  ordering;  the  appropriate  modification  of  the 
algorithm  GENERATE  K-ARY  from  the  previous  chapter  is  not  discussed  here  . 
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See  [Zaks,  1977b]  for  more  details  of  this  algorithm.  Note  that  the  reverse 

R  R    R        R 

z   of  the  sequence  z  in  Z(k,n)  have  the  property  that  z,  >  z2  >  ...  >  z  >  0 

p 

and  z  j+i  <_  ki-k+1  for  each  i,  and  that  instead  of  generating  the  sequences 

in  Z(k,n)  Hebrew-lexicographically  one  can  now  generate  these  reverses  of  the 
sequences  in  Z(k,n)  in  the  English  lexicographic  ordering. 

For  example,  the  12  sequences  in  Z(3,3),  corresponding  to  the 
ternary  trees  with  3  internal  nodes,  T(3,3),  are  generated  by  the  algorithm 
as  follows  (we  underlined  the  leftmost  z.  that  is  changed  ): 


Position 

Sequence 

1 

123 

2 

124 

3 

134 

4 

125 

5 

135 

6 

145 

7 

126 

8 

136 

9 

146 

10 

127 

11 

137 

12 

147 

Note  that,  according  to  this  generating  algorithm,  in  order  to  find  the 
sequence  next  to  a  given  one  we  scan  it  from  left  to  right  until  we  find  the 
first  (leftmost)  entry  that  can  be  incremented,  and  after  incrementing  it  we 
append  to  its  left  the  first  possible  subsequence;   Compare  it  with  the 
discussion  in  section  3.3  (p.  38),  and  also  compare  the  above  example  with 
the  one  in  section  3.6. 
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4.3  K-ARY  TREES  IN  g-ORDER:  RANKING  AND  UNRANKING 

We  now  find  the  position  index{x)   of  a  sequence  x  e  X(k,n)  in  the 
Hebrew-lexicographic  ordering  of  X(k,n).  The  modification  to  Z(k,n)  is  left 

to  the  reader.  It  was  shown  before  that  in  a  sequence  x  e  X(k,n)  erasing 

k-1 
10    patterns,  as  long  as  possible,  results  in  the  empty  sequence.  It  1s 

this  property  of  X(k,n)  that  enables  us  to  compute  the  ranking  function  in 

a  way  similar  to  what  we  did  for  binary  trees  in  chapter  3  .  An  extension 

from  a  different  point  of  view  is  used  later,  in  sections  4.4  and  4.5,  for 

a  more  general  case. 

First  we  observe  the  following: 

Theorem  4.1 :  There  is  a  one-to-one  correspondence  between  all  the  sequences 

in  X(k,n)  that  end  with  10p+k-1  and  those  in  X(k.n-l)  that  end  with  0P, 

for  each  possible  p. 

Proof:  Suppose  there  are  u  sequences  in  X(k,n)  that  end  with  10p  "  ; 

call  them  A.=Y.10P  ~  ,  i=l,2,...,u.  By  omitting  the  last  occurrence  of 

k-1  d 

10     from  each  A.  we  get  the  sequences  B.=Y.0K  ,  i=l,2,...,u,  where 

each  of  the  B. 's  is  in  X(k,n-1)  and  ends  with  0P  (see  the  discussion 

preceding  the  theorem  above).  This  correspondence  is  clearly  one-to-one, 

which  completes  the  proof.  /-> 

Note  that  the  argumentation  in  the  last  proof  generalizes  the 
discussion  in  the  end  of  the  proof  of  theorem  3.3  .   Now,  it  is  evident 
That  the  correspondence  mentioned  above  preserves  the  lexicographic  ordering; 
namely,  A.  <  A.  if  and  only  if  B.  <  B..  This  fact  will  be  used  in  the 
next  theorem,  by  which  we  can  recursively  compute  the  ranking  function: 
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Theorem  4.2:   Let  x=ylOp  e  X(k,n).  Then 

1 


INDEX{x)    =  < 


if  x=ln0(k-1)n 


a((k-l)n-p-l  ,n,k)  +  index  (yO^'^  )   otherwise 
where  a((k-l )n-p-l ,n,k)  is  the  number  of  sequences  in  X(k,n)  that  end 
with  0p+1. 

Proof:  Immediate,  by  the  definition  of  the  Hebrew-lexicographic  ordering 
and  theorem  4.1  .  f~\ 

We  now  evaluate  the  numbers  a(i,j,k)  as  defined  in  theorem  4.2  . 
The  lattice-paths  of  L(k,n)  were  introduced  in  theorem  3.4;  using  these 
correspondences  between  trees,  sequences  and  paths  in  T(k,n),  X(k,n)  and 
L(k,n),  respectively,  it  follows  that  a(i,j,k)  is  equal  to  the  number  of 
lattice-paths  from  (0,0)  to  (i,j)  that  do  not  pass  below  x=(k-l)y,  and  is 
therefore  given  by  the  recursive  definition  as  follows: 

'  1  i  =  0 

0  i  >  (k-l)j     (*) 

a(i,j-l,k)  +  a(i-l,j,k)    otherwise 

An  example  is  shown  in  fiqure  4.1,  where  the  lattice-point  (i,j)  is  labeled 
with  the  number  a(i,j,k),  and  we  show  these  numbers  for  k=3  and  j<4. 

1   i     -*fi    9    12 


a(i,j,k)  =  * 


12 


a(i  ,j,k)  for  k=3  and  j<4 
Figure  4.1 
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The  solution  to  this  recurrence  relation  (*)  follows: 
Theorem  4.3:  The  solution  to  the  recurrence  relation  (*)  for  the  a(i,j,k)'s, 
where  i ,  j  >_  0  and  k  >_2>     is  given  by 


.(i.J.k)  ■  (11_1 


\       1 


i-1 

l^Jj  /i+j-i_kt\  1        /kt\ 

ti}     \       j-t  I    (k-l)t+l  V  t  / 


/  b  \ 

i  2  ,  where  s  <  1,  is  taken  to  be  01. 

vt=l  ' 

Proof:  We  prove  by  induction  on  i  and  j.  For  i  =  0  and  any  j,  we  have  a(0,j,k) 
We  assume  the  formula  holds  for  i-1,  and  prove  it  for  i,  as  follows: 
Let  i  =  (k-1  ;x  +  y,  1  <_  y  <_  k  -  1 .  For  j  <_  x,  we  get  i  =  (k-1  )x  +  y  >_  (k-1  )j 
+  y  >  (k-1 ) j  and  in  this  case  the  formula  must  give  us  the  boundary  condition 


0,  and  it  really  does:  as  i  =  (k-l)x  +  y,   and  1  <_  y  <_  k  -  1,  thus 

So  we  get 

11-11 


I'll  -  x 
k-lj   x 


k-1 


J/i+j-l-kt\  1 


t=l 


YX\   _ 


x 

v 


t=l 


j-t    )  (k-i)t+i  \  t  ; 


l+kt\/[i-(k-l)j-l]  +  k(j-t)\  _±_ 
t  )\  j-t        I    l+kt 


\  3   J         \     J  / 


The  last  step  is  done  using  the  identity 

2  /r-tk\ /s-t(n-k)\  _r_ 
k>0\  k  j\     n-k  j    r-tk 


r+s-tn 
n 


for  integer  n  ,  r  f   tk  ,  and  0  <  k  <_  n  (see  [Knuth,  1968,  p.  58]);  setting 
k  <-  t,  r  «-  1,  t  +■  -k,  s  «•  i  -  (k-l)j  -  1  and  n  +   j  we  get  the  first  term 
(1*J)j  minus  the  term  (1+^_  J  (corresponding  to  t=0). 
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So  the  formula  gives 

ad.j.k).^"1)  +(i+j-1)  -  ('JJ)  =0,  as  desired. 

Assuming  it  holds  for  j  -  1 ,  we  continue  as  follows:  if  j  >  x,  we  have 
i  =  (k-l)x  +  y  <  (k-l).i  +  y,  or  i  <_  (k-l)j,  and  we  show  that  a(i,j,k)  satisfies 
a(i,j,k)  =  a(i,j-l,k)  +  a(i-l,j,k).  By  the  induction  hypothesis  we  have 

i-1 


"^■"■rn-^ra^TFiWc?) 


and 


a(l-l.J.k)  =  (1ft2 


i-2 


-k;lj/i+J-2-kt\  1  /ktN 

t:1  [  j-t   j  (k-Dt+i  u;. 


Therefore, 


i-1 


k-1 
a(i.j-l.k)  +  a(i-l,j,k)  =  f1^"1)  -       2" 

v     1    '         t=l 


(...)  + 


i-2 
k-1 

"  2  " 
t=l 


(...) 


It  remains  to  show  that 


i-1 


i-2 


k-1 


Lk;lj/i+J-2-kt\  1  (kt\    ,    ^y'J/1+j-2-kt\  1  /kt 

W   \  J-l-t     j   (k-l)t+l    \ti         t*    I    j-t      j   (k-l)t+l   [t 

i-1 


_Lk;lj/i+j-l-kt\  1  fkf 

"  tt}   \     J-t      J   (k-l)t+l.  \t  I  . 


(**) 


If  y  >  1  then  r-f  =  tV  =  x»  and  a^  tne  summations  are  I  ( 
(**)  is  correct.   If  y  =  1,  we  have 


) ,  hence 


i-1 
x-1 


=  x, 


i-2 


k-1 


=  x  -  1, 


but  then  a  term  corresponding  to  t  =  x  in  the  second  summation  on  the  left 
side  of   (**)is  (J4*»  );  but  j  >  x,  so  this  is  0.  The  proof  is  thus  completed, 

O 


65 

As  for  the  unranking  algorithm,  it  can  be  found  in  a  'reverse' 
interpretation  of  theorem  4.2  (in  a  way  similar  to  the  one  by  which  the 
unranking  algorithm  of  section  3.5  was  built  from  the  ranking  algorithm  of 
section  3.4).  More  details  can  be  found  in  [Zaks,  1977b]. 

As  an  example  we  take  the  8-th  sequence  z=136  from  the  example 
in  p.  60  (the  12  sequences  in  Z(3,3)  ).  This  sequence  corresponds  to  the 
0,1  sequence  x=101001000  and  to  the  lattice  path  shown  in  figure  4.1  (p.  62). 

When  applying  the  ranking  algorithm  to  the  sequence  x,  we  get 
INDEX(x)   =  IWZ?ffX(  101 00 1000)  = 

=  a(2,3,3)  +  INDEX(]0]000) 

=  a(2,3,3)  +  a(0,2,3)  +  IWZ?£T(  1 00 ) 

=  a(2,3,3)  +  a(0,2,3)  +  1 

=  6+1  +  1 

=  8  . 
Note  that  the  numbers  used  in  computing  the  ranking  fuction  are  exactly  to  the 
left  of  the  corresponding  lattice-path  (see  figure  4.1),  plus  an  additional  1; 
Compare  this  with  the  similar  discussion  in  the  note  at  the  end  of  section  3.4 
(p.  43).   As  for  the  unranking  algorithm,  we  are  looking  for  the  8-th  sequence 
in  X(3,3).  In  [Zaks,  1977b]  this  example  is  discussed  based  on  the  unranking 
algorithm  studied  there.  Here  we  present  a  geometric  approach  to  this 
algorithm.  We  start  at  the  point  (6,3)  (see  figure  4.1)  and  go  to  the  left 
until  we  see  the  first  label  which  is  smaller  than  s=8;  it  is  a(2,3,3)  =  6, 
that  labels  the  point  (2,3),  in  this  case.  Hence  we  make  a  turn  at  the  point 
(3,3)  and  go  down  one  step,  and  set  s  <-   s  -  a(2,3,3)  =  2.  This  procedure  is 
now  repeated  until  we  reach  the  origin  (0,0):  We  go  to  the  left  until  we  meet 
the  first  label  which  is  smaller  than  s=2;  it  is  a(0,2,3)  =  1,  so  we  make  a 
turn  at  (1,2),  go  down  one  step  and  set  s  +-  s  -  a(0,2,3)  =  1;  the  path  is 
now  completed  through  (0,1)  to  (0,0). 
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4.4  GENERAL  CLASSES  OF  TREES:  GENERATION 

We  generalize  now  the  discussion  in  the  previous  chapter  for  binary 

trees  to  the  classes  T(K,N)  of  trees  that  have  n.  nodes  of  degree  k.  ,  i  =  1, 

1  t  n 

2,...,t  for  some  t,  and  nn+l  leaves,  where  nn  =  E  (k.-l)  n.  .  We  denote 

u  u  i=1   1     1 

K=(kQ,  k, ,  ...  ,  k.)  and  N=(n0,  n-. n.)  and  assume  throughout  this 

discussion  that  k.  >  k._,  >  ...  >  kn=0  .  For  t=l  we  get  the  classes  T(k,n) 

of  regular  k-ary  trees  with  n  internal  nodes.  The  trees  are  generated 

according  to  the  a-order  (section  2.4). 

Define  A(K,N)  to  be  the  set  of  integer  sequences  a  =  {a,-}?  that 

have  n.  occurrences  of  the  integer  k.  and  n«  O's  (the  k. 's  and  the  n.'s  are 

as  defined  above)  and  have  the  extended  dominating  property.  A  sequence 

a  =  {a.}1?   is  said  to  have  the  extended  dominating  orooerty  if  in  each  prefix 

1  t 

the  number  of  O's  is  not  larger  than  i     (k.-l)-(the  number  of  k.'s  in  the  prefix) 

i=l   n  ^ 

The  following  theorem  was  proved  in  [Chorneyko  and  Mohanty,  1975]  (using  the 

reverses  of  our  sequences): 

Theorem  4.4:  There  is  a  one-to-one  correspondence  between  the  trees  in  T(K,N) 

and  the  sequences  in  A(K,N). 

Proof:    To  see  this,  associate  with  a  tree  T  in  T(K,N)  the  sequence  a(T)  of 

degree  numbers  (see  p.  12).  Since  a(T)  is  built  in  a  preorder  traversal  of 

the  tree  T,  it  follows  that  the  father's  degree  appears  in  a(T)  before  his 

sons';  also,  a  tree  with  x.  internal  nodes  of  degree  y.  has  E(y.-1)»x.  +  1 

i 
leaves.  From  these  facts  we  deduce  that  a(T)  has  the  extended  dominating 

property.  Based  on  these  arguments  it  can  also  be  shown  that  this 
correspondence  is  one-to-one.  O 

Note  that  the  sequences  in  A(K,N)  are  also  in  A  (see  p.  13)  for 
n  =  zk.«n.  ;  in  other  words,  for  each  sequence  a  in  A(K,N)  we  have   I  a.  >_  i 
for  each  1 . 
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It  was  proved  in  the  Ordering  Lemma  (p.  23)  that  T  <  T'  if  and  only 

a 

if  a(T)  <a(T').   Therefore,  in  order  to  find  the  generating,  ranking  and 
unranking  algorithms  for  the  classes  T(K,N)  of  trees,  it  suffices  to  find 
these  algorithms  for  the  corresponding  integer  sequences  in  A(K,N).  In  this 
section  we  develop  the  generating  algorithm  for  A(K,N),  given  K  and  N,  and  the 
next  section  deals  with  the  ranking  and  unranking  algorithms. 

We  now  establish  a  one-to-one  correspondence  between  A(K,N)  and  the 
set  L(K,N)  of  lattice-paths  from  (nQ;  n,,  ...  ,  n.)  to  the  origin  (0,0, ...,0) 
that  do  not  pass  below  the  hyperplane  x~  =  I  (k.-l)  x.  :  Given  a  sequence  in 
A(K,N),  the  corresponding  path  starts  at  (nn,  n, ,  ...  ,  n.)  and,  for  each  j, 
if  a.  =  k  then  the  j-th  segment  is  in  the  m-th  direction  .  (All  segments 
are  directed  towards  the  origin  (0,0, . . .  ,0) . )  More  formally,  an  element  of 
L(K,N)  corresponds  to  a  sequence  of  lattice-points  L»L, ...L  ,  where   L~  = 


(nQ,  n} 


,  nt),  Ln  =  (0,0,. ..,0)  and  if  a.  =  km  and  L.  -,  =  (xQ,  x-j,...,xt) 


then  L.  =  (xQ,  ...  ,  x^ ,  xm-l,  xm+] ,  ...  ,  xt)  for  each  j. 

As  an  example,  consider  the  tree  in  figure  4.2  which  is  an  element  of 


a(T)  =3202000 


A  tree  in  T((0,2,3) ,(4,2,1 ) )  and  the  corresponding  sequence 

Figure  4.2 
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T(K,N),  where  K  (0,2,3)  and  N=(4,2,l);  In  other  words,  this  tree  has  one 
node  (n2=l)  of  degree  3  (k2=3),  two  nodes  (n-,=2)  of  degree  2  (k-^2)  and  four 
(n«=4)  leaves  (kQ=0).  Labeling  each  node  with  its  degree,  and  reading  these 
labels  in  preorder,  results  in  the  corresponding  sequence  in  A(K,N),  also 
shown  in  figure  4.2.  The  corresponding  lattice-path  from  (4,2,1)  to  (0,0,0) 
in  L(K,N)  is  shown  in  figure  4.3  . 


(4,2,1) 


xQ  =  X]  +  2x2 


A  path  corresponding  to  the  tree  of  figure  4.2 
Figure  4.3 
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Let  us  turn  now  to  the  generating  algorithm.   As  explained  before 
(p.  38),  given  a  sequence  a,  in  order  to  find  the  sequence  next  to  it  in  the 
lexicographic  ordering  we  scan  the  sequence  a  from  right  to  left  until  we  find 
the  first  (rightmost)  j  such  that  a.  <  a.+,.  Let  a  =  a,a?...a.a,  and 
suppose  there  are  m  occurrences  of  k   in  a  for  s  >_0.  To  get  the  next 
sequence  we  replace  a.  by  a  ,  the  smallest  positive  element  in  a,  and  append 
to  its  right  the  first  possible  subsequence;  this  subsequence  must  contain 
m'  occurrences  of  k  for  s  >  0,  where 


m 


i  _ 


m 


if  s  >.  0,  s  f   j,p 


<      m.  +  1   if  s  =  j 

ni  -  1   if  s  =  p, 
L   P 


The  first  such  subsequence  is  clearly 

,    k-,-1  mi    k9-l  mi        k.-l  m! 
od(k1o  '  )  ]  (k2o  l    )  2  •••  (kt0  t  )  t 

where   d  =  mi  -  E  m'*(k  -1); 
U     i   s   s 
S>J 

This  is  discussed  in  somewhat  more  detail  in  [Zaks  and  Richards,  1977]. 


It  is  clear  that  the  first  sequence  in  A(K,N)  is 


k,-l  n,     k9-l  n0       k.-l  n. 
(^0  ]  )  '  (k20  2  )  2  ...  (kt0  *  )  t 


and  that  the  last  one  is 
n 


t    nt-l      nl   Zi  (kr1),ni 

kk  kf) 

Kt   Kt-1  Kl   U 


The  following  algorithm  starts  with  the  first  sequence  in  A(K,N), 
and  proceeds  from  a  sequence  to  its  successor  until  it  reaches  the  last  one 
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Algorithm  GENERATE_T(K,N)  (generating  A(K,N)  lexicographically,  given  K  and  N) 


1 .  (first  sequence) 

k,-l  n,     k9-l  n9       k.-l  n. 

a  ■  a1a2"-ai:k.n.  *"  (kl°    )   (k2°    }      {kt°    )   • 
a~  «-  0  (a~  is  the  sentinel) 

2.  (scanning  the  current  sequence  a=  a.  ) 
j  +■  zk.ni  ; 

P  *■  kt+l  ; 

for  i  -*-  0  to  t  dp_  m.  +■   0  ; 

while  a.  ,  >  a .  do   begin 
J-l  -  J  —   — a — 

if  p  >  a.  >  0  then  p  «-  a 
m.  «-  m.  +1  ; 

J       J 


J  ' 


J  +  j-l  ; 

end 

if  j  =  0  then  stop  (last  sequence) 

(j  now  points  at  the  rightmost  i  such  that  a.  <  a.+,) 

(next  sequence) 

(we  have  m.  of  the  a.'s  equal  to  k.  to  the  right  of  a. 
l       it  J 

a   is  the  smallest  positive  number  to  the  right  of  a. 


aj"ap     ; 

*,  *  m.  ♦  1     ; 

nL  «-  m„  -  1     ; 
P         P 

ap+lap+2*"azkini 

4. 

goto  2  . 

mn-  z  m„(k  -1)    k,-l  m,    k0-l  m0      k.-l  m. 
0  °  «»  s  s   (^0  '  )  '(k20  2  )  2...(kt0  l     )   ) 


As  an  example,  the  21  sequences  in  A(K,N),  for  K=(0,2,3)  and 
N=(4,2,l),  corresponding  to  the  ordered  trees  with  one  internal  node  of  degree 
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3  and  two  internal  nodes  of  degree  2  (one  of  which  has  been  shown  in  figure  4.2), 
are  generated  by  this  algorithm  as  follows: 


Position 

Sequence 

1 

2020300 

2 

2023000 

3 

2030020 

4 

2030200 

5 

2032000 

6 

2200300 

7 

2203000 

8 

2230000 

9 

2300020 

10 

2300200 

11 

2302000 

12 

1320000 

13 

3002020 

14 

3002200 

15 

3020020 

16 

3020200 

17 

3022000 

18 

3200020 

19 

3200200 

20 

3202000 

21 

3220000 

(As  in  the  previous  examples,  we  underlined  the  rightmost  a-  discussed  in  the 
text,  and  we  omitted  the  sentinel  aQ=0.) 
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4.5  GENERAL  CLASSES  OF  TREES:  RANKING  AND  UNRANKING 


In  this  section  we  compute  the  function  INDEX(L)   that,  given  a 

path  L  £  L(K,N),  will  compute  its  corresponding  position  in  the  lexicographic 

ordering  of  L(K,N);  also,  given  an  integer  w,  we  construct  the  path  L  e 

L(K,N)  such  that  INDEX  (I)   =  w.  As  discussed  earlier,  the  paths  L  =  L-.  L, 

...  L  inL(K,N)  are  those  lattice  paths  from  the  point  (n«,  n, ,  ...  ,  n.) 

to  the  origin  which  do  not  go  below  the  hyperplane  xn  =  e  (k.-l)x-. 

0  i=1  l    i 

We  make  use  of  the  multinomial  coefficients 


d       \  J  0  if  any  d.  is  <  0 


dr  d2,  ...  ,  d£/   ^    dV.    ...  .  d^,   otherwise 

i 
where  d  =  2.-  d..  The  multinomial  coefficient  has  a  familiar  interpretation 

i=1  ] 
as  the  number  of  lattice  paths  from  the  point  (d, ,  d«»  ...  ,  d  )  to  the  origin. 

This  interpretation  gives  a  combinatorial  proof  of  the  following  lemma,  which 

is  also  easily  proved  directly  from  the  above  definition: 

Lemma  4.1  If  d  =2  d.  and  all  d.  are  integers,  then 

f "'      d      W  (  d  - 1  \ 

VdT  d2 Af^   \dT  d2'  •••  '  di-T  di  '  '"  d'+1'  •"  '  dV 

Let  C(nQ,n,,n2,  •••  ,n.)  denote  the  number  of  lattice  paths  from  the  point  (nQ, 

n, ,  ...  5  n.)  to  the  origin  which  do  not  go  below  the  hyperplane  x^  =  2  (k.-l)x-. 

i=l 

The  following  theorem  defines  these  entries  recursively,  and  solves  the 

recurrence  relation: 
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Theorem 4. 5:  The  solution  to  the  recurrence  relation 


C(nQfn1 ,n2,. ..  ,nt)  =  / 


^ 


Z   C(nn,n, ,...  ,n.-l 
j=0   u  '     J 


n.<0  for  i  =  1 ,2,.. . ,  or  t 

nQ  =  n^  =  ...  =  nt  =  0 

nn  =  ?  (k.-l)n.  -  1 
i=l  ] 

,n  )  otherwise 


is  given  by 

C(n(J,n1,n2,...>nt)  =  [n^ ,. . .  ,nt)   -  2  (k.-l  )(nQ+l  ,n]  ,. . .  ,n.-l ,. . .  ,nt 

where  n  =  n~  +  n,  +  ...  +  n. . 

Proof:  We  show  that  C(nQ,n.|,  ...  ,  nt) ,  as  given  by  (*),  satisfies  the  re- 
currence relation  and  the  boundary  conditions.  When  n.  <  0  for  i  =  1,  2,  . 
or  t  (*)  gives  the  value  0  by  definition.  The  case  nQ  =  n,  =  ...  = 

n,  =  0  is  taken  care  of  in  the  same  way.  When  nn  =  £  (k.-l)n.  -  1  and  no 

u       i=i     i         i 

n.   is  <  0,   (*)   can  be  rewritten  as 

C(VVn2 nt>  -  (Vl)!nJ!...nt!    ^V1     "  .*     'V""^ 

from  which  it  is  clear  that  C(n0,n,  , . . .  ,n  )  is  0  for  this  case.  If  nn,  n,, 
...  ,  n,  are  none  of  the  above,  we  prove  the  recurrence  by  induction  on  n. 
For  n  =  1,  (*)  is  correct.  We  assume  that  it  holds  for  any  m  <   n, 
and  take  n  =  n^  +  n,  +  ...  +  n,.  By  the  recursive  definition  of  C(n~,n,,. 


(*) 


,nj 


we  have 


C(n0,nr...,nt)  =  2  C^.n, ... .  ,n  -l,...,nt) 


.  :0 
For  each  of  the  terms  on  the  right,  we  use  (*),  by  the  induction  hypothesis, 

and  get  .  .       n-1 


t  / 


C(nQ,n1 ,.. .  ,nt)  =  I  \nQ,n1 , . . . ,n  .-1 , . . . ,nt 

•J   ^ 


\ 


t   t      ,  n-l 

"  js0  M  1"  \n0+l,n1,...,nrl1...,nJ-l,...,ntJ 
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t  .        n-1 

=  I 


j=0Vn0,nr...,nj-l,...,nt 


-  z  (k-i)  If  n_1  \ 


J 
which,  by  the  previous  lemma,  gives 

C(Vnl "t»  =(n0.n,,n..,nt)  "  J,  'V1 '  (  nQ+1  .„, .. .  .?„,-! ,. . .  ,„t ) 

as  desired.  /~\ 

This  recurrence  relation  has  been  solved  for  the  case  t  =  1  in 

previous  works:  For  k,  =2  it  was  solved  in  [Whitworth,  1878],  and  a  solution 

for  an  arbitrary  k,  is  found  in  [Yaglom  and  Yaglom,  1964].  A  solution  for  the 

1  t 

general  problem  for  points  on_  the  hyperplane  xn  =  z  (k.-l)»x_.  is  given  in 

u  i=l   n    ] 
[Chorneyko  and  Mohanty,  1975]  using  generating  functions.  See  also  [Richards, 

1978]  for  a  note  related  to  these  countings. 

Given  a  path  L  e  L(K,N),  we  show  now  how  to  compute  its  position 
INDEX [L)     in  the  lexicographic  ordering  of  L(K,N);  because  of  the  immediate 
relation  between  L(K,N)  and  A(K,N)  it  is  clear  that  this  will  also  give  us  a 
way  to  deal  with  the  sequences  in  A(K,N). 

Theorem  4. 6:  Let  a  =  a,ap...a  e  A(K,N)  and  the  corresponding  lattice-path 
L==  L0Lr..Ln  e  L(K,N),  where  Li  =  (y.Qi   y^,  ...  ,  y^).  Then 

n-1  Vr1 

INDEX {L)   =  1  +  Z    z   C(y.n,  y.,,  ...  ,  y..-l       y..) 
i=0  j=o     iu   i i        ij   ,  ...  ,  ix 

where  the  C( .,.,...,.  )'s  are  given  by  theorem  4.5  ,  and  the  b. 's  are  defined 

as  follows:  if  a.  =  k.  then  b.  =  j,  for  i=l,2 n  . 

(Note  that  since  it  is  only  the  relative  values  of  the  k.'s  that  is  important 
during  the  generation,  ranking  or  unranking,  for  the  listing  of  the  sequence 
in  A(K,N),  one  can  prefer  working  with  the  numbers  b.  rather  than  the  a.'s; 
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more  details  can  be  found  in  [Zaks  and  Richards,  1977]  .) 

Proof:  By  definition  we  know  that  all  the  sequences  that  begin  with  either  of 

kA,k,  ...  ,  or  k,   ,  will  come  before  this  sequence  a,  and  the  number  of  these 

0  ]  V1  bri 

sequences  is  indicated  by  the  inner  summation   2  .  This  follows  from  the 

j=0 

definitions  of  b.,  y. .  and  theorem  4.4  . 
i   ij 

Next,  we  know  that  all  the  sequences  that  start  with  k,  0,k.  k,,..., 

Dl   Dl  ' 
or  k,  k,   ,  will  come  before  the  sequence  a,  and  the  number  of  these  sequences 

bi  V1  b  -l 

is  indicated  by  the  summation   j  .  The  rest  follows  by  induction,  following 

J-0 
this  line  of  argument. 

The  constant  1  is  added,  as  in  the  previous  ranking  functions,  so 

that  the  indexing  will  begin  with  1  rather  than  0.  /-\ 

It  is  clear  from  the  proof  how  the  ranking  function  in  this  general 
case  extends  the  one  for  binary  trees  (see  note  in  p.  43). 

To  illustrate  this  procedure  we  refer  to  the  tree  and  path  in  figures 
4.2  and  4.3,  respectively.  In  figure  4.3  each  lattice  point  above  the  plane 
x0  =  x,  +  2x«  has  been  labeled  with  C(x0,x, ,x«),  the  number  of  ways  to  reach 
from  this  point  (x^x-pXp)  to  the  origin  (0,0,0)  while  not  passing  below  this 
plane.  We  go  along  the  path  and,  after  each  unit  step  that  we  make  in  a  certain 
direction,  we  add  all  the  labels  in  the  preceding  directions  (xA  preceds  x, 

and  x,  preceds  x«).  We  thus  get  for  the  path  L  in  our  example 

INDEX  (I)   =  1  +  12  +  5  +  0  +  2  +  0  +  0  +  0  =  20. 

As  for  the  unranking  algorithm,  we  will  follow  theorem  4.6  in  a 
reverse  interpertation.  We  are  given  a  number  i,  and  look  for  a  path  L  such 
that  INDEX  {I)   =  i.  We  first  show  the  unranking  for  the  previous  example: 
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Suppose  we  want  to  find  the  20-th  sequence  in  L(K,N),  where  K=(0,2,3)  and  N= 
(4,2,1).  Starting  at  the  point  (4,2,1)  we  sum  up  the  labels  in  directions  0, 
1,...  (see  figure  4.3),  in  that  order,  as  long  as  we  do  not  exceed  20-1=19. 
Here  we  take  C(4,l,l)=12,  which  corresponds  to  making  the  first  move  from 
(4,2,1)  to  (4,2,0),  or  a, =3  (the  first  move  has  been  done  in  the  third 
direction).  Starting  now  at  (4,2,0),  we  sum  up  the  labels  in  directions  0,1,. 
as  long  as  we  do  not  exceed  19-12=7;  here  we  take  C(3,2,0)=5,  which 
corresponds  to  making  the  second  move  from  (4,2,0)  to  (4,1,0),  and  so  on. 
The  unranking  algorithm  is  formally  described  as  follows;  the  proof  follows 
from  the  above  discussion. 

Algorithm  UNRANKJ(K,N)   (finding  the  w-th  sequence  in  A(K,N),  given  w, 

K  and  N  as  in  the  discussion  above) 

u  *■  w  -  1 ; 

(y0,yr...>yt)  -  (n0,nr...,nt); 

for  i  -  1  to  n  do 
begin 

(Find  the  largest j  such  that  sum  of  entries  in 

the  first  j  directions  does  not  exceed  u) 

j  «-  0; 

sum  «-  0; 

S  -  C(yQ-l,  yr  ...  ,  yt); 

while  sum  +  S  <_  u  do   begin  sum  «•  sum  +  S;  j  «-  j  +  1 ; 

S  -  C(yQ,  ...  tfj-1,  ...,,  yt); 


end; 


ai  *  kj  5 


y^  *■  yn-  -  'I  ; 

end. 


u  ■«-  u  -  sum  ; 
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CHAPTER  5 
A  GRAPH  LABELING  PROBLEM 


5.1   INTRODUCTION 


We 


are  given  n  tasks  t-, ,  t2 t   to  be  executed  on  two 


identical  processors,  that  share  a  common  memory.  Task  t.  requires  c. 
of  the  memory  for  its  execution,  for  each  i  (  0  <_  c  <_  1  ).  Two  distinct 
tasks  t.  and  t.  can  be  executed  simultaneously  on  the  two  processors  if 
and  only  if  their  total  memory  requirement  does  not  exceed  the  available 
amount  of  memory,  namely  if  and  only  if  c.  +  c.  <_  1.   This  situation 
gives  rise  to  an  undirected  graph  in  which  a  vertex  n.  is  associated  with 
the  task  t. ,  and  there  is  an  edge  connecting  two  distinct  vertices  n.  and 
n.  if  and  only  if  the  corresponding  tasks  t.  and  t.  can  be  executed 
simultaneously.  We  denote  by  GR   the  class  of  graphs  that  can  be  obtained 
in  this  way. 

In  other  words,  from  a  graph  theoretic  point  of  view,  we  are 
interested  in  the  class  GR   of  undirected  graphs  for  which  there  exists  a 
labeling  of  the  vertices  with  numbers  in  the  interval  [0,1]  such  that  two 
distinct  vertices  are  connected  by  an  edge  if  and  only  if  the  sum  of  their 
labels  does  not  exceed  1. 

When  extending  the  problem  to  the  case  where  there  are  s 
resources  shared  by  the  two  processors,  we  get  extensions  of  this  class  of 
graphs.  In  this  case  the  task  t.  requires  c..  of  the  j-th  resource 
during  its  execution,  for  i=l,2,...,n  and  j=l,2,...,s.  Tasks  t.  and  t. 
can  be  executed  simultaneously  if  and  only  if  their  total  requirement  of 
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the  j-th  resource  does  not  exceed  the  available  amount  of  this  resource  (for 
each  j),  namely  iff  c^   +  c-k  <_  1  for  k=l,2,...,s.  This  gives  rise  to  the 
classes  GRS  of  graphs  for  which  there  exists  a  labeling  of  the  vertices  with  s- 
dimensional  vectors  over  the  interval  [0,1]  such  that  two  vertices  are 
connected  by  an  edge  if  and  only  if  the  sum  of  their  labels  does  not  exceed 
(1, !,...,!).  It  turns  out  that  a  graph  is  in  GR   if  and  only  if  it  is  the 


intersection  of  s  graphs  from  GR  ,  and  that  GR1  is  properly  included  in 


GR1+1  for  i  >  1 


It  is  also  possible  to  extend  our  scheduling  problem  to  the  case 
when  we  have  m  >  2  processors;  In  this  case  a  set  of  (not  more  than)  m 
tasks  can  be  executed  simultaneously  under  the  same  conditions  (namely,  not 
using  more  than  the  available  amount  of  any  resource),  which  means  that  in 
this  case  we  have  to  deal  with  hypergraphs  (rather  than  graphs),  in  which 
the  vertices  correspond,  as  before,  to  the  tasks,  and  the  edges  are  those 
sets  of  at  most  m  vertices  that  correspond  to  sets  of  tasks  that  can  be 
simultaneously  executed. 

In  all  of  these  scheduling  problems,  it  can  be  expected  that  some 
knowledge  of  the  structure  of  the  graphs  will  be  helpful  if  one  wants  to 
find  schedules  of  minimal  total  execution  time  or  in  case  one  wishes  to 
analyze  heuristic  algorithms  for  these  scheduling  problems  (see  [Johnson, 
1974],  [Coffman,  1976]  and  [Ecker,  1977]). 

Another  motivation  to  study  these  classes  of  graphs  is  given  in 

[Chvatal  and  Hammer,  1977].  Given  a  system  of  linear  inequalities 

n 

z   a. .  x.  -  1       i  =  l,2,...,m,    (*) 
j=l   1J  J  " 

where  each  a.,  is  either  0  or  1 ,  they  want  to  find  out  whether  there  exists 
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a  single  inequality 
n 

I        C.  X.  •:  b  (**) 

j=l     J   J 

that  have  the  same  set  of  zero-one  solutions  as  (*).  (The  c's  and  b  are 
integers.)   Let  A  =  (a..)  be  the  matrix  containing  the  a^'s.  It  is 
clear  that  the  zero-one  solutions  of  (*)  are  completely  determined  by  those 
pairs  of  columns  of  A  that  have  a  positive  dot  product. 

Chvcital  and  Hammer  build  the  graph  G(A)  in  which  there  is  a 
vertex  corresponding  to  each  column  of  A,  and  two  distinct  vertices  are 
connected  by  an  edge  if  and  only  if  the  corresponding  two  columns  have  a 
positive  dot  product.  The  zero-one  solutions  of  (*)  correspond  to 
independent  sets  of  vertices  (a  set  of  vertices  is  independent  if  no  two 
of  the  vertices  in  it  are  connected  by  an  edge).  Therefore,  in  order  to  be 
able  to  'simulate'  (*)  by  (**),  there  must  be  a  labelinq  of  the  vertices 

of  the  graph  G(A)  with  numbers  c,,c« c   such  that  the  zero-one 

solutions  of  (**)  will  be  exactly  the  characteristic  vectors  of  independent 
sets  of  vertices  in  G(A). 

This  class  of  graphs,  known  as  threshold  graphs,  coincides  with 
the  class  GR   defined  above,  and  has  been  studied  in  the  literature  also  in 
connection  with  synchronization  problems  ([Golumbic,  1976],  [Henderson  and 
Zalcstein,  1977]).   From  Chvcital  and  Hammer's  point  of  view  a  graph  G  is  a 
threshold  graph  if  and  only  if,  when  regarding  the  subsets  of  the  n  vertices 
as  points  in  the  n-dimensional  unit  cube,  there  exists  an  hyperplane  Ec.x.=b 
seperating  the  independent  sets  of  vertices  from  the  other  sets  of  vertices. 
From  another  point  of  view,  a  graph  G  is  a  threshold  graph  if  and  only  if 
there  exists  an  integer  labeling  of  its  vertices  and  a  threshold  b  such  that 
two  vertices  are  connected  by  an  edge  if  and  only  if  the  sum  of  their  labels 
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exceeds  the  threshold  b  ([Golumbic,  1976]). 

Extending  their  problem,  Chvatal  and  Hammer  want  to  find  s 

inequalities 

n 

I     c..  x,  ib.       i=l,2,...,s       (***) 
j=l  1J  J    n 

such  that  (*)  and  (***)  will  have  the  same  set  of  zero-one  solutions.  For 

this  it  is  necessary  and  sufficient  that  the  graph  G(A),  defined  above,  will 

be  the  union  of  s  threshold  graphs.  It  follows  that  these  graphs  are  the 

complements  of  the  graphs  in  our  class  GRS 

In  this  chapter  we  first  discuss  the  class  GR  ,  reviewing  results 
from  [Chvatal  and  Hammer,  1977],  [Henderson  and  Zalcstein,  1977]  and 
[Golumbic,  1976],  and  extending  them,  and  following  this  discussion  we 

study  the  classes  GR  .   In  section  5.2  we  introduce  notations  and  basic 

1  s 

notions;  the  class  GR   is  treated  in  section  5.3,  and  the  classes  GR  ,  for 

general  s,  are  studied  in  section  5.4. 


5.2  PRELIMINARIES 

In  this  section  we  deal  with  undirected  graphs  without  self -loops 
or  multiple  edges.  Such  a  graph  is  described  by  a  pair  (V,E),  where  V  is  the 
set  of  vertices  and  E,  the  set  of  edges ,  is  a  set  of  unordered  pais  of 
vertices. 

Let  G=(V,E)  be  a  graph;  two  vertices  v,w  e  V  are  connected 
(by  an  edge),  or  adjacent,  if  (v,w)  e  E.  G  is  connected  if  for  any  two 
vertices  v  and  w  there  is  a  sequence  of  vertices  (v,,  v2,  ...  ,  v.)  such 
that  v-|=v,  vk=w,  and  (v. ,  v-+,)  e  E  for  i  =  l  ,2, . . .  ,k-l .   The  neighborhood 

of  v  e  V  is  N(v)  =  (w|(v,w)  e  E};  the  cardinality  DEG(v)  =   |N(v)|  is  called 
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the  degree  of  v     (for  a  set  A,   |A|    denotes  the  cardinality  of  A).     We  will 
also  use  a  modification  of  the  notion  of  the  neighborhood:   N'(v)=N(v)u{  v  }. 
V.  denotes  the  set  of  all  the  vertices  with  degree  i.     For  instance,  VQ 
contains  all   the  isolated  vertices.    DEG(Qt)  denotes  the  maximal   degree 
among  the  vertices  of  G.     The  complementary  graph  of  a  graph  G=(V,E)   is  a 
graph  GC=(VC,EC)  where  VC=V  and  Ec={  (v,w)|  v,w  e   V,  v  +  w,   (v,w)  ^   E  }.    Let 
V'eV;  the  subgraph  induced  by  V    is  defined  as  G'  =  (V',E'),  where  E'   contains 
these  edges  with  both  endpoints  in  V.     Instead  of  'subgraph  induced  by   ...  ' 
we  will   use  the  notion    'subgraph'    throughout  the  paper.     Graphs  G.=(V. ,E. ), 
i=l,2,  are  called  isomorphic  if  there  exists  a  bijection  <j>  :y.    ■*■  V«     such 
that  V  v,w  e    V, 

(v,w)  e    E1     **       (<J)(v),  *(w))  e    E2   . 

Let  s  e    N=   {1,Z,...}  .  A  function  R:   V    h.[0,1]S   (the  set  of  s- 
dimensional   vectors  over  the  interval   10,1]  )   is  called  an  s-labeling  of  G 
if 

(v,w)  £    E     «-      R(v)+R(w)^  (1,1,. ..,1). 

For  example,  the  function 

R1  :  v]  ■>  1/3,  v2  ■>  1/2,  v3  -  0,  v4  ->■  1 

is  a  1 -labeling  for  the  graph  G,  in  figure  5.1  .  For  the  graph  G^,  the 
function 

R2  :  W]  ->  (3/4,1/2),  w2  ->  (1/4,1/2),  w3  ■>  (1/2,3/4),  w4  ■>  (1/2,1/4) 

is  a  2-labeling;  one  can  easily  verify  that  no  1 -label ing  exists  for  G„  . 

As  will  turn  out  later,  for  a  fixed  s  e  N,  not  every  graph  has  an 
s-labeling.  Therefore,  it  is  of  interest  to  characterize  those  graphs  that 
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can  be  s-labeled;  we  denote  this  class  of  graphs  by  GR5 


'1  ' 


'2  ' 


A  graph  G-j     in  GR1 ,     and  a  graph  G^  in_GRJ 


but  not  in  GR 


Figure  5.1 


Clearly,  for  a  given  s  e  N  and  a  graph  G,  if  R  is  an  s-labeling 
for  G,  then  R  is  not  the  only  possible  s-labeling  for  G;  there  are  infinitely 
many  such  labelings.     On  the  other  hand,  given  a  set  V  of  vertices  and  a 
function  R:  V  •*■  [0,1]    ,  there  is  a  uniquely  determined  graph  G=(V,E)   for 
which  R  is  an  s-labeling;  its  set  of  edges  is  precisely 
E  -  {(v,w)  |    R(v)+R(w)  <  (1,1 1)   }. 


83 


5.3  1 -LABELED  GRAPHS 


,1 


In  this  section  a  complete  characterization  of  GR   is  given.  When 
studying  an  integer  programming  problem  (see  section  5.1),  [Chva'tal  and  Hammer, 
1977]  defined  a  class  of  graphs,  called  threshold  graphs,  which  are  identical 
to  GR  .  The  structure  of  these  graphs  is  also  investigated  in  [Henderson  and 
Zalcstein,  1977],  [Golumbic,  1976  and  1978].  These  results  are  summarized  in 
the  sequel  (theorem  5.1). 

We  say  that  a  graph  has  the  subgraph  property  if  it  does  not  contain 
an  induced  subgraph  isomorphic  to  2HL,  P*  or  C.  (see  figure  5.2). 


The  forbidden  graphs 
Figure  5.2 


A  graph  G=(V,E)  has  the  inclusion  property  if  (i)  or  (ii)  is  true 
for  all  v,w  e  V: 

(i)  If  (v,w)  i   E,  then  N(v)s  N(w)  or  N(w)s  N(v). 

(ii)  If  (v,w)  e  E,  then  N'(v)s  N'(w)  or  N'(w)s  N'(v). 
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Theorem  5.1 :   The  following  are  equivalent: 

(1)  G  e  GR1. 

(2)  G  has  the  subgraph  property. 

(3)  G  is  a  threshold  graph  ([Chvatal  and  Hammer,  1977]). 

(4)  G  has  the  inclusion  property. 

(5)  G  and  Gc  are  both  trivially  perfect  ([Golumbic,  1978]). 
Proof:  (1)  «-*  (3)  is  proved  in  [Golumbic,  1976],  (2)  *-*■  (3)  +-*  (4)  in 
[Chvatal  and  Hammer,  1977],  and  (2)  «-*  (5)  in  [Golumbic,  1978].    >-v 

We  now  describe  an  algorithm  that  1 -labels  a  given  graph  G  =  (V,E) 
in  GR1;  if  G  j.   GR1  it  results  in  a  1-labeling  of  a  graph  G'  =  (V,E')  in  GR1 
where  E'  a  E. 

Algorithm  1-LABELING 

(we  are  given  a  graph  G=(V,E),  |V|=n,  with  vertex  sets 
Vq,  V,,  ...  ,  V DEG(n\     as  discussed  above  (section  5.2), 
and  define  a  labeling  function  R:  V  ■*■  [0,1]) 

1.  V«-  V  ; 

for  i  «-  0  to  DEG(G)     do 

begin 

while  Vi  f   0  do 

2.  begin 

choose  v  e  V.  ; 

3.  R(v)  -«-  l-i/2n  ; 
while  N(v)n  V  f  t   do 

begi  n 
4t  choose  w  e  N(v) n  V   J 
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5.  R(w)  «•  1/2n  j 

6.  II  w  c  V .  then  V .  *  V,-{w]  > 


7. 

V  «■  V-{w>  ; 
end; 

8. 

V.  *  V.-{v}  ; 

9. 

V  «-  V'-{v}  ; 
end; 

end 

An  example  that  demonstrates  how  the  algorithm  works  follows  the 
proof. 

Proof  of  correctness:  Let  G=(V,E)  4  GR  ,  (u,u')  e  E,  and  suppose  the  algorithm 
labels  u  first.  If  u  is  labeled  in  step  3  by  l-j/2n  then  u'  is  labeled  in 
step  5  by  j/2n.  If  u  is  labeled  in  step  5  by  j/2n  then  u'  gets  a  label  <  l-j/2n. 
In  both  cases  R(u)+R(u')  <.  1,  hence  we  result  in  a  1-labeling  for  a  graph  G'= 
(V,E')  where  E'=  E. 

Let  now  G  e  GR^  .  Following  the  previous  discussion  it  is  sufficient 
to  show  that  (v,w)  f.   E  implies  R(v)+R(w)  >  1  .  We  consider  three  cases: 

(1)  R(v)=l-i/2n,  R(w)=l-j/2n. 
Then  R(v)+R(w)  >  1. 

(ii)  R(v)=l-i/2n,  R(w)=j/2n. 

If  j>i  then  R(v)+R(w)  >  1;  If  j  <_  i  then  ]u  e  V.  such  that  w  e  N(u) 
and  R(u)=l-j/2n,  DEG{u)=j   (due  to  the  algorithm).  From  R(v)=l-i/2n  follows 
DEG[v)=i.     Therefore  DEG(u)  <_  deg{m) .  Because  of  the  inclusion  property 
we  obtain  N(u)  £N(v).  Hence  w  e  N(v),  which  is  in  contradiction  to 
(v,w)  t   E. 
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(iii)  R(v)=1/2n,  R(w)=j/2n. 

There  exists  a  vertex  v'  e  N(v)  with  R(v')=l-i/2n  and  D£G(v')=i, 
and  similarly  a  w'  e  N(w)  with  R(w')=l-j/2n  and  DEG{vi')=j.     Suppose 
j  >_  1 ;  then  we  obtain  DEG(w)   >_  j  >_  i  =  DEG{v'),   from  which  we  get 
N(w)  =  N(v').  But  as  v  e  N(v')  we  have  a  contradiction.     _ 

The  following  example  demonstrates  how  the  algorithm  1 -LABELING 
works.  The  sets  of  vertices  of  the  same  degree  in  the  graph  of  figure  5.3  are 

V{v7}»  Vl=0»  V2={V'  V3={V1'V2'V'  V0'  V{v3}'  V{v5'v6}  ' 

For  i=0,  R(v7)=i  according  to  the  steps  2  and  3  of  the  algorithm. 

If  i=2,  R(v4)=l-2/2n  =  14/16  (steps  2  and  3),  and  R(v5)=R(v6)=2/16  (steps 

4  and  5).  Similarly,  for  i=3  we  obtain  R(v1)=13/16,  R(v3)«3/16,  and  R(v2)= 

R(v8)=13/16. 


v1  (13/16) 


v2  (13/16) 


(13/16) 


v3  (3/16) 


v4  (14/16) 


v6  (2/16) 


v5  (2/16) 


An  example  to  algorithm  1 -LABELING 
Figure  5.3 
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From  the  way  the  algorithm  is  designed,  the  following  is  an  immediate 
consequence: 

Corollary:   For  any  graph  G=(V,E)  e  GR   there  exists  an  1  -label ing  R:  V  -*■  [0,1] 
for  which  R(v)  f   0  for  each  v  e  V. 

We  next  study  the  structure  of  GR  .  First  we  state  a  lemma  which  is 
proved  in  [Henderson  and  Zalcstein,  1977]. 
Lemma  5.1:   Let  G=(V,E)  e  GR1 .  Then 

(1)  If  G  contains  no  isolated  vertices  then  G  is  connected. 

(2)  If  G  is  connected  then  there  exists  a  vertex  with  degree  |V|-1. 


The  following  theorem  describes  the  structure  of  GR  : 


Theorem  5.2 


i  n   i 

(1)  The  number  of  non-isomorphic  graphs  in  GR   with  n  vertices  is  2 

(2)  Furthermore: 

2    of  them  are  connected, 

2    of  them  have  one  connected  component  with  n-1  vertices 

and  1  isolated  vertex, 

n-4 
2    of  them  have  one  connected  component  with  n-2  vertices 

and  2  isolated  vertices, 


n-k 

have  one  connected  component  with  n-k+2  vertices  and 

k-2  isolated  vertices, 


1  has  one  component  with  2  vertices  and  n-2  isolated  vertices, 

and 
1  has  n  isolated  vertices. 
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Proof:  As  can  be  easily  seen,  for  the  graphs  in  figure  5.4  the  theorem  holds 


n=l  :   • 
n=2  :   • 


• • 


n=3 


:.  /.  l 


Theorem  5.2 
Figure  5.4 

Assume  it  holds  for  n.  According  to  the  previous  lemma  we  can  get  all  graphs 
in  GR  having  n+1  vertices  by  taking  the  following  two  disjoint  sets  of  graphs 

(1)  All  the  graphs  in  GR  with  n  vertices  and  an  additional  isolated 
vertex  to  each,  and 

(2)  All  the  graphs  in  GR  with  n  vertices  and  an  additional  vertex  - 
connected  to  all  the  n  vertices  -  to  each. 

The  construction  from  n=3  to  n=4  leads  to  the  graphs  of  figure  5.5  . 


(1) 


•   • 


•   • 


i:  l* 


(2) 


71    Vi    0 


Theorem  5.2  (continue) 
Figure  5.5 
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Our  construction  is  valid  following  the  discussion  in  the  previous  sections. 

Then  it  is  obvious  that 

_  i 

(a)  the  set  of  graphs  obtained  in   (1)  contains  2  "     non-isomorphic  graphs. 

The  same  is  true  for  the  set  of  graphs   (2).     As  the  sets  obtained  in 
(1)  and  (2)  are  disjoint,  there  are  2*2n~  =2n  classes  of  non-isomorphic 
graphs  in  GR,  each  having  n+1   vertices. 

(b)  From  (2)  we  know  that  GR     contains  2n"  =2^n+  '       non-isomorphic 

graphs  with  n+1  vertices  which  are  connected.     Suppose  G  has  n  vertices, 

n-2-k 
and  k  of  them  are  isolated,  0<k<n-2.     If  there  are  2  such  non- 

isomorphic  graphs  in  GR1   then  there  are  2n'2"k=2^n+1 )_2"(k+1 )  non- 
isomorphic  graphs  in  GR    with  n+1   vertices,  where  k+1   of  them  are 
isolated. 
This  completes  the  proof. 

O 

Note:    The  number  2    of  graphs  in  GR   is  also  given  in  [Chva'tal  and 
Hammer,  1977]  . 

Following  lemma  5.1  a  one-to-one  correspondence  between  the  graphs 

in  GR  with  n  vertices  and  the  binary  numbers  with  n-1  digits  is  now 

established  : 

Given  a  binary  number  x=a,ap...a  _,  ,  where  each  a.  is  either  0  or  1 , 

we  build  the  graph  G(x)  =  (V,E)  with  V={vQ,  v] ,  ...  ,  vn_.,}  and  we  connect 

v.,  1  <_  j  <  n,  to  v.  ,  i  <  j,  if  and  only  if  a.  =  l. 
J  i  J 

For  example,  let  n=8  and  x=1011001;  we  get  the  graph  G(x)  shown  in 
figure  5.6. 

Thus  we  have  proved  the  following  theorem: 
Theorem  5.3:  For  any  integer  n>0,  there  is  a  one-to-one  correspondence  between 
the  graphs  in  GR  with  n  vertices  and  the  binary  sequences  of  length  n-1. 
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x=a,a2a3a.a5a6a7=1011001 


V0  Vl 

•^ • 


V6# 


•  v, 


•  v. 


6# 


v5  V4 

Step     (a) 
a,=l  ■*  connect  v,   to  v~ 


a3=1 


v5  v4 

Step     (b) 

connect  v3  to  v . ,  i<2 


v7# 


a4=l 


connect  v-  to  v. ,  i<3 


Constructing  the  graph  G(x) 
Figure  5.6 
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A  vertex  v.  is  connected  to  all  the  vertices  v.,  j>i,  if  and  only  if 

a.=l,  and  this  contribution  to  deg{v.)     is  clearly   z  a.  .  Also,  v..  is 
J  j>i  J 

connected  either  to  all  or  to  none  of  the  v.'s  for  j<i,  depending  on  whether 

J 

or  not  a.=l.  Therefore  we  have: 

Corollary:  Given  x  and  G(x)  as  discussed  above,  the  degrees  DEG{v.)   are 

determined  by 

DEG(v.)   =  ia.  +  z     a.  ,  i=0,l ,. . . ,n-l  . 
j>i  J 


The  adjacency  matrices  corresponding  to  graphs  in  GR  have  interesting 
properties  which  we  may  also  use  to  determine  the  number  of  non-isomorphic 
graphs  in  GR  with  n  vertices  each. 

Let  G=(V,E)  e  GR1,  V=tVl,...,vn>  ,  and  R:  V  ^  [0,1]   be  a  1-labeling 
for  G  such  that  R(v-])<  R(v2)  <...  <  R(v  ).  Define 

AQ  =  {  v  |  ve  V,  R(v)  <  1/2  }  =  {  vr  v2,  ...,  vk  >  , 

A]  ={v|  Ve  V,  R(v)  >  1/2  }  =  tvk+1,Vk+2,...,Yn  >  . 
If  R( v.+-j )+R( v.  )  <_  1  we  could  change  R(v.+,)  to  1/2  and  transfer  v.+1  to  AQ. 
It  is  then  clear  that  R(v. +1 )+R(vk+2^  >  !•  So  we  assume  without  loss  of 
generality  that  in  AQ  and  A,  as  defined  we  have  r(vic+-i)+r(vi<)  >    1- 

As  RCv^^  R(vi+1)  we  have  deg(Vj  )  >  2?^(vi+1)  and  N(v.)2N(v.+1) 
for  i=l ,2,. . .  ,n-l  (this  follows  from  definitions;   see  also  the  remark 
preceding  lemma  5.2).  We  also  have 

v,w  E  AQ  ^  (v,w)  e  E, 
v,w  e  A-,  -*■  (v,w)  f    E. 

Following  this  discussion,  we  conclude  that  the  adjacency  matrix 
of  G  must  have  the  following  structure: 
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/ 


'k+1 


k+2 


'n   V 


Vl  Vz 


an  arbitrary  lattice 
path  connecting  A  and  B 


V. 


all  the  entries 
are  1 


all  the  entries 
are  0 


n  -  k 


0/ 


\ 


/ 


>^ 


Sk-l 


J 


The  number  of  these  matrices  for  given  n  and  k  is  equal  to  the 
number  of  paths  from  A  to  B,  i.e. 


rtr")  ■  cd 


From  the  structure  of  the  matrices  it  follows  that  G, ,G2  e  GR 
result  in  the  same  matrix  iff  they  are  isomorphic.     Therefore,  the  number  of 
non-isomorphic  graphs  in  GR    with  n  vertices  each  is 

i-l    . 


I  (-1)  ■ 2"' 
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One  may  ask  whether  any  graph  G=(V,E)  in  GR   can  be  labeled  by  a 
two-valued  function  R:  V  ->  {0,1}  .  However,  this  is  not  true,  as  can  be 
easily  seen.  Furthermore,  we  show  that  no  finite  set  of  labels  is  sufficient 
for  labeling  all  graphs  in  GR  .  We  do  this  by  using  the  following  property 
of  1 -label ing: 
Lemma  5.2:  Let  R  be  a  1-labeling  for  G=(V,E)  e  GR1 .  Then 

R(v)  <  R(w)  «-►  DEG(v)  >  DEG(w) 


for  all  vertices  v,w  e  V.  Moreover,  if  R(v)=R(w)  then  DEG{v)=DEG{vi) . 
Proof:  Follows  immediately  from  the  definition  of  1-labeling.      ,-^ 

Theorem  5.4:  No  finite  number  of  distinct  labels  suffices  for  labeling  all 
the  graphs  in  GR  . 

Proof:   For  G  e  GR  we  know  that  R(v)=R(w)  -*■    DEG{v)=DEG(w)     (lemma  5.2). 
So,  given  t  e  N  we  construct  a  graph  G.  in  GR  that  has  t  distinct  degrees 
of  vertices,  and  therefore  it  must  contain  t  distinct  labels. 

The  grap+i  G.  has  t+1  vertices  v, ,  v?,  ...  ,  v.+,  ;  the  label  on 
v.  is  i/(t+2).  It  follows  that  the  degrees  of  the  various  vertices  are 
given  by 

'  t+l-i  i  £ 

t+2-i  i  > 


deg{v.)  =  ' 


t+1 


2 
t+1 


For  each  t  e  N,  G   cannot  be  1-labeled  with  less  than  t  distinct  labels, 


which  proves  the  theorem. 


O 
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5.4  S-LABELED  GRAPHS 

A  detailed  characterization  of  graphs  in  GR  ,  for  general  s,  similar 
to  that  for  GR   as  discussed  in  the  preceding  section,  is  much  more 
complicated.  Unfortunately,  we  do  not  have  such  a  characterization  for  s>l . 

However,  we  present  some  interesting  facts  about  these  classes.  First  note 

s    s+1 
that  we  clearly  have  GR  c  GR    for  every  s,  since  an  s-labeling  of  a 

graph  can  easily  be  extended  to  an  (s+1 )-labeling  by  adding  a  new  all -zero 

component. 

1     2 
Theorem  5.5:  The  hierarchy  GR  £  GR  £  ...   is  infinite  (namely,  all  the 

inclusions  are  proper). 

Proof:  We  show  that  for  every   s>l  there  exists  a  graph  G  which  is  in  GR  but 

not  in  GRS_1   (Gs  e  GRS  -  GRS_1). 

Let  K  denote  the  complete  graph  with  n  vertices,  and  let  V  and  V" 

be  two  disjoint  sets  of  s  vertices  each.  We  define  G   to  be  the  graph 

K'  u  K"  ,  where  K'  and  K"  are  the  complete  graphs  with  vertex  sets  V  and 

V",  respectively  (see  figure  5.7). 


The  graph  G, 
Figure  5.7 
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(a)     Gs  e   GRb. 


The  function  R:  V  ■*    [0,l]s   with  the  components  R,,...,R     defined  in  the 
following  table     is  an  s-labeling  for  G       (v  e  (0,1/4)   ): 


vl 

v2 

•    •    • 

vs 

wl 

w2 

•   •   • 

ws 

R1 

1-1) 

V 

•    •    • 

V 

l-2v 

2v 

•   •   • 

2v 

h 

V 

1-v 

.  .  . 

V 

2v 

l-2v 

.  .  . 

2v 

• 

9 

• 

• 

*s 

V 

V 

.  .  . 

1-v 

2v 

2v 

.  .  . 

l-2v 

(b)     Gs   /  GR5"1. 

The  proof  is  based  on  a  decomposition  of  the  vertex  set  of  k-labeled 

graphs.     We  first  demonstrate  this  decomposition.     Let  G=(V,E)  e  GR  , 

and  let  R  be  a  k-labeling  for  G.     We  decompose  the  vertex  set  V  into  2 

disjoint  subsets  Un,...,U  L       :   if  a.  ...a-,   is  the  binary  representation 


J0 


2k-l 


of  i   e  {0, ,2-1}   ,   i.e. 


define 


k-1 
i  =     s 
j=0 


2Ja, 


j+l    ' 


U.   =  {V  e  V   |    R,(v)   <  1/2  ^     a.=0,  j=l 

vj  J 


k} 


The  following  properties  of  vertices  in  U.   can  be  immediately 
derived     (for the    case  k=2  the  decomposition  of  V  is  shown  in  figure  5.8) 

(1)  Vv  e   UQ:     R(v)  <   (1/2 1/2);  hence   (v,w)   e  E     Vv,w  e  UQ. 

(2)  V  1   e  {l,...,2k-l}   ,  \/v,w  e  U.:     R(v)+R(w)  I  (l,...l);  hence 

(v,w)  t  E. 
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(3)  Let  i,j  e  {0,. . .  ,2-1}  ,  v  e  U. ,  w  e  U . ,  and  (v,w)  e  E.  Then, 

if  a.  ...a,  and  b.  ...b,  are  the  binary  representations  ofi  and  j, 

respectively,  we  have  a  =0  or  b  =0  for  all  y  e  {!,... k>  . 

Consequences  of  (1),  (2)  and  (3)  are:  If  G  contains  a  complete  subgraph 

K,  then  the  vertices  are  distributed  among  the  sets  Un,...,U  ,    such 

u     2-1 
that  each  of  the  sets  U.  with  i>0  contains  at  most  one  vertex  of  K.  If, 

however,  G  contains  two  disjoint  complete  subgraphs  K'  and  K",  then  the 

vertices  of  one  subgraph,  say  K' ,  must  be  members  of  UQ  only,  whereas 

the  vertices  of  K"  are  distributed  among  U, ,...,U  .   .  According  to 

1     2-1 
properties  (2)  and  (3),  K"  cannot  contain  more  than  k  vertices. 


U 


u. 


(  >  1/2,  >  1/2) 


U 


0 


(  <  1/2,  <  1/2) 


(<  1/2,  >  1/2) 

-a 

(  >  1/2,  <  1/2)  J 


Decomposition  of  the  vertex  set 
Figure  5.8 
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s-1 
This  result  is  now  applied  to  the  graph  G$.  Suppose  Gs  is  in  GR   .  As 

G  contains  two  disjoint  complete  subgraphs  with  s  vertices  each,  the 

vertices  of  one  subgraph  cfre  distributed  among  the  sets  U,  ,...U   ,  . 

1    2s  -1 
However,  as  this  graph  cannot  contain  more  than  s-1  vertices,  we  get  a 

s-1 
contradiction.  Therefore  G  I   GR   ,  which  completes  the  proof. 

The  following  lemma  shows  that  the  graphs  in  GR  with  a  fixed  set 
of  vertices  may  be  viewed  as  some  kind  of  basis  for  all  graphs  with  the  same 
set  of  vertices: 

c 

Lemma  5.3:  A  graph  G=(V,E)  e  GR   if  and  only  if  there  exist  s  graphs 

G.=(V,E.)  e  GR1,(i=l,2,...,s),such  that   G  =  n  G.   (where  nG.  =  (  nV. ,  nE.)> 

i  i      i  n  i  "" 

(V,  nE.)  ). 

i 

Proof:     Let  R=(R] R$):  V  +[0,l]s    be  an  s-labeling  for  G.     If  (v,w)  e  E 

then  R^vJ+R^w)  <  1     for  all   i   e  {l,...,s}   .     Define     G.  =  (V,E.)     where 
Ei   =  ((v,w)    |   Ri(v)+Ri(w)  <  1}   .     Then 

(v,w)  e  E  ^  R(v)  +  R(w)  <  (1,...,1) 

^    Vie  (  !,..., s  }  Rn.(v)  +  R^w)  <  1 

^  V1  £  {  l,...,s  }  (v,w)  eE.  ,  Q 

3 

As  an  example,  for  the  graph  in  figure  5.9  the  function  R:V  ■*■   [0,1] 

defined  by 


vertex  R  =  (R^.Rg.R-) 

^  ■*-  (3/4,1/4,1/2) 

v2  ^  (1/4,3/4,1/4) 

v3  ■>  (1/2,1/4,3/4) 

v4  -  (1/2,1/2,1/4) 

v5  ->  (1/4,1/2,1/2) 


is  a  3- labeling. 
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Each  of  the  components  of  R,  namely  R, ,  R„  and  R3,  defines  a  1-labeling  for 
a  graph  G1=(V,E1)  E  GR1 ,  i  =  l,2,3,  and  G1  n  G2  n  63  =  G.  The  graphs  G] ,  G2 
and  G~  are  shown  in  figure  5.10  . 


An  example  for  the  intersection  property 
Figure  5.10 

It  is  clear  from  the  definition  of  GR  that  every   complete  graph  is 


in  GR  .  However,  from  theorem  5.5  we  know  that  a  graph  that  consists  of  two 
disjoint  complete  graphs  of  the  same  size  s>l  is  contained  in  GR   but  not  in 


GR 


s-1 


.  The  following  theorem  generalizes  this  idea: 
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Theorem  5.6:  Let  seN,  G=(V,E)eGRs,  and  let  G'=(V',E')  be  a  complete  graph 
with  s  vertices,  where  V'n  V  =  0.  Then  Gu  G'  e  GR  . 

Proof:  For  G  one  can  always  find  an  s-labeling  R:  V  -+[0,1  ]s  where  Vv  e  V 
R(v)  ^  (0,...,0):  According  to  lemma  5.4  G  can  be  represented  as  intersection 
of  graphs  G.  =  (V.  ,E. )  e  GR  ,  where  V.=V  for  i=l,...s.  For  each  of  the  graphs 
Gi  there  is  a  1-labeling  R.. :  V  +  [0,1]  such  that  R^v)  t   0  for  all  v  e  V 
(corollary , p. 87).  Hence  the  combined  function  R=(R-, ,. . .  ,R  ) :  V  -*  [0,l]s  is 
an  s-labeling  for  G  which  has  the  desired  property. 

Let  G'  =  (V'  ,E')  be  a  complete  graph  and  V'^w, w  }  ,  Vn  V  =  0. 

The  function  R':  V  ->  [0,l]s,  defined  by  R':  w,  ->  (0. .  .010. .  .0) ,  j=l,...s, 

3  3 

defines  an  s-labeling  for  G'.  Then,  for  G"  =  Gu  G'  =  (Vu  V  ,  Eu  E1),  the 

function  R":  Vu  V  -*  [0,1  ]s  ,  defined  by 


l"(v)  =  | 


R(v)   if  v  e  V, 
R'(v)   if  V  e  V, 


is  an  s-labeling.     In  fact,  as  R(v)  has  non-zero  components  only,  we  have 
RM(v)+R"(w)  I  (1,...,1)  for  every  v  e  V  and  w  e  V.  Q 

For  instance,  as  an  immediate  result,  the  graph  in  figure   5.11     is 

2 

a  member  of  GR  . 


•    •    » 


A  graph  which  is  in  GR  (by  theorem  5.6) 
Figure  5.11 
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It  is  easily  shown  that  for  any  graph  6  e  GR   also  its  complement 

c       1 
G  is  in  GR  .  It  follows  that  the  class  of  s-threshold  graphs,  defined  in 

[Chvatal  and  Hammer,  1977]  as  unions  of  s  graphs  in  GR  (see  also  section  5.1), 

coincides  with 

{  G  |  Gc  e  GRS}  . 

Hence  the  class  of  s-threshold  graphs  differs  from  GRS  for  s>l. 

For  example,  the  graph  in  figure  5.12  is  in  GR  ,  whereas  the 

comlementary  graph  is  of  the  type  of  graphs  G  (see  theorem  5.5),  and 

2 

therefore  is  not  in  GR  . 


2 
A  graph  in  GR  ,  whose  complement  is  not 

Figure  5.12 

In  general,  the  problem  of  deciding  whether  a  given  graph  is  in  GR  , 
for  given  k,  seems  to  be  much  more  difficult  than  the  case  where  k=l.  It 
follows  from  [Chva'tal  and  Hammer,  1977]  that  this  decision  problem  is  an 
NP-complete  problem  (see  [Cook,  1971],  [Karp,  1972]). 

Lemma  5.4:   Let  G=(V,E)  e  GRS,  w  i   V,  V  c  V,  and  connect  w  to  all  the 

s+1 
vertices  in  V.  Then  the  graph  thus  obtained  is  in  GR 

Proof:  Let  R=(R,,  R2>  ...  ,  Rs)  be  an  s-labeling  for  G.  An  (s+1 )-labeling 

R1  is  defined  as  follows: 
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R'Cv)  =  i 


,  Rs(v),  0  )  for  v  e  V 


(  R^v),  R2(v),  . 

(  R^v),  R2(y),  ... 

(  0   ,0   ,  ...  ,  0   ,  1   )  for  v=w. 


Rc(v),  1/2  )  for  ve  V-V 


o 


A  recursive  application  of  lemma  5.4  yields  the  following: 
Corollary:  If  a  graph  G  has  n  vertices  then  G  e  GRn. 

For  example,  applying  this  procedure  to  the  pentagon  in  figure 
5.9  ,  we  get  the  following  labeling  of  its  vertices: 


vertex     R  =  (R-,  ,R2,R3,R4,R5) 


v1  ■*  (1/2,  0  ,1/2,1/2,  0  ) 

v2  ->  (  n  ,  1  ,  0  ,1/2,1/2) 

v3  ■*  (0,0,1,0  ,1/2) 

v4  ->  (0,0,0,1,0) 

v5  -*  (0,0,0,0,1) 

3   2 
Lemma  5.5:  Every  cycle  with  at  least  five  vertices  is  in  GR  -GR  . 

Proof:  A  path  P  ,  of  length  n-1  is  easily  shown  to  be  in  GR  ;  one 

possible  2-labeling  is  shown  in  figure  5.13.  Using  lemma  5.4  it  follows 


n-1 


n-1   1 


-)  ( 


n+r  n+r  Vr  n+r  Vr  n+i 


n-2v    #n-3 


-) 


'n-1 


A  2-1abeling  for  P 
Figure  5.13 


n-1 
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that  the  cycle  Cn  with  n  vertices  is  therefore  in  6R  .  One  can  show  that 
C5  i   GR  (in  [Ecker  and  Zaks,  1977]  a  theorem  is  given  that,  in  certain 

instances,  might  help  in  deciding  that  a  certain  graph  cannot  be  s-labeled, 

2  2 

given  s);  We  now  show  that  C   for  n  >  5  is  not  in  GR  .  Suppose  C  e  GR  ; 

then  by  lemma  5.3  there  exist  graphs  G',  G"  e  GR   such  that  C  =  G'  n  G". 
Since  G'  and  G"  are  both  connected,  they  must  contain  vertices  v'  and  v", 
respectively,  of  degrees  n-1  each  (by  lemma  5.1).  If  these  two  vertices  are 
not  adjacent  on  C   then  the  edge  connecting  them  is  in  both  G'  and  G",  hence 
it  is  in  G'  n  G",  a  contradiction.  If  these  two  vertices  are  adjacent  on  C 
then  we  get  the  configuration  shown  in  figure  5.14  .  We  now  prove  that  the 


C  cannot  be  2-labeled 

— n 

Figure  5.14 


edge  (u',u")  must  be  in  both  G1  and  G".  In  G'  we  have  edges  e-,  and  e3  that 
form  a  forbidden  subgraph.  Hence,  according  to  theorem  5.1,  G'  must  contain 
the  edge  (u',u").  Similarly,  the  edges  e~  and  e,  form  a  forbidden  subgraph  in 
in  G",  therefore  (u',u")  must  be  also  in  G".  Therefore  (u',u")  must  be  in 
G'  n  G",  a  contradiction.    In  both  cases  we  have  a  contradiction,  and  this 
completes  the  proof.  r\ 
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In  contrast  to  the  subgraph  property  of  GR  (theorem  5.1),  a  similar 

subgraph  property  for  GRS,  based  on  a  finite  number  of  forbidden  graphs,  does 

2 

not  exist:  following  the  lemma  just  proved,  for  the  class  GR   we  have 

infinitely  many  pairwisely  non-isomorphic  forbidden  graphs,  namely  all  the 
cycles  of  length  at  least  five  (note  that  any  induced  subgraph  of  C  is  in 
GR2). 

We  conclude  by  further  examples.  By  direct  inspection  it  can  be 
verified  that 

1.  all  graphs  with  13  vertices  are  in  GR  , 

2 

2.  all  graphs  with  1  4  vertices  are  in  GR  , 

3 

3.  all  graphs  with  <.  6  vertices  are  in  GR  , 

4.  the  only  graphs  with  four  vertices  that  are  not  in  GR  are 

those  in  figure  5.2,  and 

2 

5.  the  only  graphs  with  ±   6  vrtices  that  are  not  in  GR  are 

those  in  figure  5.15  . 
More  enumeration  results  are  found  in  the  following  table: 


Total 

Number 

Number 

Number 

Number 

Number 

number 

of 

of 

of 

of  graphs 

of 

of 

graphs 

graphs 

graphs 

in  GRS 

vertices 

graphs 

in  GR1 

in  GR2 

in  GR3 

s  <_  4 

1 

1 

1 

1 

1 

1 

2 

2 

2 

2 

2 

2 

3 

4 

4 

4 

4 

4 

4 

11 

8 

11 

11 

11 

5 

34 

16 

33 

34 

34 

6 

156 

32 

133 

156 

156 
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V 


All  graphs  with  <6  vertices  not  in  GR* 
Figure  5.15 
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CHAPTER  6 
EVGE  LABELWGS   FOR  TREES 

6.1   INTRODUCTION 

Given  n+1  cities  that  are  directly  connected  by  a  tree-like 
communication  network,  and  n  communication  lines  of  different  values  of  a 
certain  property,  we  want  to  assign  these  communication  lines  ('weights') 
to  the  direct  connections  ('edges')  of  the  qiven  network,  optimizing 
certain  objective  functions.  In  case  this  property  represents  capacity  or 
reliability,  we  would  like  to  maximize  our  objective  functions,  and  in  case 
it  represents  loss  or  vulnerability  we  would  like  to  minimize  them. 

Our  objective  functions  are  of  various  types.  For  example,  the 
function  SUMDIS  is  measuring  the  average  communication  cost  of  the  network 
(see  T_Hu,  1974]  and  [Lenstra,  Rinnooy  Kan  and  Johnson,  1976]).  The  diameter 
is  measuring  the  maximal  communication  cost  in  the  network,  and  the  radius 
is  measuring  the  maximal  communication  cost  from  a  directory  optimally 
located  in  the  network  .  Let  c(e)  and  v(e)  denote  the  capacity  and 
vulnerability  of  a  communication  line  assigned  to  the  edge  e,  respectively. 
Define  the  capacity  and  vulnerability  between  the  vertices  i  and  j  as 

c(i,j)  =  min  {c(e)}  and  v(i,j)  =  max  (v(e)}, 
where  e  is  on  the  path  connecting  i  and  j.  Then  the  functions  SUMMIN   and 
SUMMAX  measure  the  average  capacity  and  vulnerability  of  the  network  . 

In  section  6.2  we  present  several  NP-complete  problems,  concerning 
optimizing  the  radius  and  the  diameter  of  a  tree  and  of  a  binary  weighted 
(0,1  weights)  general  graph.  Polynomial -time  algorithms  for  special  cases 
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of  minimizing  the  radius  of  a  tree  are  shown  in  section  6.3.  In  section  6.4 
we  present  polynomial-time  algorithms  for  optimizing  average  measurement 
functions  (e.g.,  SUMMAX).     Open  problems  are  found  in  section  6.5. 

6.2  PRELIMINARIES 

Let  T»(V,E)  be  a  tree  with  vertices  V={1 ,2,. . . ,n+l }  and  edges 

E={e-|,  e^,...,  e  }.  A  tree  is  considered  to  be  unrooted,  unless  otherwise  stated 

A  vertex  of  degree  one  is  a  leaf,  and  an  edge  incident  with  a  leaf  is  a  terminal 

edge.  P  denotes  a  path  with  n  vertices.  W={w, ,  w«,. . . ,  w  }  is  a  set  of 

weights  such  that  0  <  w.„  *w,  <  w0  <  •••   wMawm,x/  .  A  labeling  of  the  edges 
a —  mm  1—2—    —  n  max    -*■       3 

of  T  with  weights  from  W  is  a  bijection  f:  E  -  W.  In  this  chapter  we  study 
label ings  that  optimize  certain  objective  functions. 

Let  p(i,j)  denote  the  (unique)  path  connecting  vertices  i  and  j  in 
T.  Given  a  labeling  f  of  T,  the  distance  d^(i,j)  between  i  and  j  is 

df(i,j)  =    Z  f(e)  . 

eep(i.j) 

The  diameter    Df(J)     of  the  tree  T,  labeled  with  f,  is 

Df(l)  =      max  {df(i  ,j)}     . 
i<j 

Note:  We  omit  the  references  to  f  and  T  whenever  possible;  i.e.,  we  use  the 

notations  d(i,j)  and  D   instead  of  dJi.j)  and  DJT) ,  respectively,  and 

similarly  for  the  functions  defined  in  the  sequel. 

A  center  of  a  labeled  tree  is  a  vertex  i  for  which  max  (d(i,j)} 

J7i 
is  minimal,  and  this  value  is  called  the  radius  R  of  the  tree;   i.e., 

R  *  min  {max  {d(i  ,j)}}  . 
1    J 
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The  terms  diameter,  radius  and  center  are  similarly  defined  for  labeled 

undirected  graphs.  In  a  rooted  tree  the  center  is  located  at  the  root. 

The  maximum  weight  and  minimum  weight  on  p(i  ,j)  are 

max(i.j)  =  max        {f(e)>  and   m1n(1,j)  =  min    {f(e)>  , 
eep(i.j)  eep(i  ,j) 

respectively. 

I  p ( i » j )|   is  the  number  of  edges  in  p(i,j). 

is  the  average  weight  on  p(i,j). 

The  quantities  which  we  optimize,  for  a  given  tree  T,  over  all 
possible  label ings  f,  are  the  following: 

1.  D   , 

2.  R  , 

3.  SUMDIS{J)  =     e     d(i,j)    , 

i<j 

4.  SUMMAX{J)  =     E    max(i ,j)    , 

i<j 

5.  SUMMIN{T)  =     e     min(i.j)    ,     and 

i<J 

6.  SUMAW{1)  =     e     avr(i  ,j)     . 

i<j 
The  functions     (3)-(6)     are  average  measurements , si  nee  the  summations  are 
over  all   pairs  of  vertices. 
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6.3  DIAMETER  AND  RADIUS:  NP-CQMPLETE  RESULTS 

In  this  section  the  following  problems  are  shown  to  be  NP-complete 
(see  [Cook,  1971],  [Karp,  1972],  and  the  book  by  [Garey  and  Johnson,  1979]): 

PROBLEM  1 :  Given  a  tree  T  with  n  edges,  a  set  W  of  n  non-negative 
weights,  and  an  integer  k>0,  to  determine  whether  there 
exists  a  labeling  f  with  diameter  <_  k. 
PROBLEM  2:  Like  PROBLEM  1,  for  radius  <  k. 

This  result  holds  also  for  rooted  trees. 

PROBLEMS  1  and  2  remain  NP-complete  when  the  weights  are  bounded 
PROBLEM  3:  Like  PROBLEM  1,  for  radius  >  k. 

Maximizing  the  diameter  of  a  tree  is  trivial. 
PROBLEMS  1 ,  2  and  3  are  NP-complete  also  for  the  corresponding 
vertex  labeling  problems. 
PROBLEM  4:  Given  a  connected  graph  G  with  m  edges  and  a  set  of  0,1 
weights  W,  |W|=m,  to  determine  whether  there  exists  a 
labeling  with  radius  <_  1. 
PROBLEM  5:  Given  a  connected  graph  G  with  n+1  vertices  and  a  set  of 
0,1  weights  W,  |W|=n,  to  determine  whether  G  contains  a 
spanning  tree  which  can  be  labeled  such  that  its  diameter 
<  2. 


The  reductions  are  from  the  PARTITION  problem  and  the  MAXIMUM 
TERMINAL  SPANNING  TREE  problem,  both  known  to  be  NP-complete  (see  [Karp,  1972] 
and  [Garey  and  Johnson,  1979]): 
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PARTITION:  Given  positive  integers  a^ ,  i=l,2,...,n,  to  determine 
whether  there  exists  a  set  Ic{l,2,...,n>  such  that 


z  a.  =  Z   a.  . 
1el     Ul 


MAXIMUM  TERMINAL  SPANNING  TREE:  Given  a  graph  G  and  an  integer  k>0, 
to  determine  whether  G  has  a  spanning  tree  with  at  least 
k  terminals  (=1  eaves). 


Proof  for  PROBLEM  1 : 

We  show  that  PARTITION  is  reduced  to  PROBLEM  1.  Given  a.,  i=l,2,...,n, 
(an  instance  of  PARTITION),  we  define  the  following  instance  of  PROBLEM  1: 

?A 


^^  n  edges 


C 


W  =  {a,,  a«,  ...,  a,  0,  0,  ... ,  0,   z.  a.}  . 

L  V        ^  4 


n  O's 


k  =  f  i  a, 


We  prove  that  there  exists  a  solution  to  the  PARTITION  problem  iff  there  exists 

a  labeling  of  T  with  diameter  <_  k. 

Suppose  there  exists  a  solution  to  the  PARTITION  problem,  namely 

z     a.  -     z     a.   for  some  Ic{l ,2,. . .  ,n}  .  The  following  labeling  has  a 
iel  n   Ml     1 
diameter  k:  label  AB  with  Z.  a.  ,  spread  the  a.'s  for  iel  and  i^I  on 

BC  and  BD,  respectively,  and  label  the  rest  of  the  edges  with  O's. 

Suppose  there  exists  a  labeling  with  diameter  <_  k.  One  can  find 

such  a  labeling  in  which  za.  labels  the  edge  AB  (if  in  the  given  labeling 

AB  is  labeled  with  a.,  for  some  t,  and  Za.  labels  another  edge  e,  then  by 


no 


interchanging  la.  and  at  the  diameter  is  not  increased).  This  means  that  the 
a.'s  on  BC  and  on  BD  must  sum  up 
to  the  given  PARTITION  instance. 


a.'s  on  BC  and  on  BD  must  sum  up  to  j  2a.,  each,  hence  we  have  a  solution 


O 

Proof  for  PROBLEM  2: 

We  prove  that  PARTITION  is  reduced  to  PROBLEM  2.  Given  a.  as 
above,  we  define  the  following  instance  of  PROBLEM  2: 

T  =  P2n+1   • 

W  =  {a-,,  a0,  ... ,  a  .  0,  0,  ...,  0}    ,  and 

i   c  n  >     „     * 

,1  n  0's 

k  =  j  sa-  • 

The  rest  follows  immediately.  r\ 

The  same  reduction  holds  in  the  case  when  this  path  is  rooted  1n  its 
center,  which  proves  that  PROBLEM  2  is  NP-complete  also  for  rooted  trees. 

As  was  pointed  by  [Johnson,  1978],  PROBLEM  1  and  PROBLEM  2  remain 
NP-complete  when  the  weights  are  bounded,  and  this  is  shown  by  a  similar 
reduction  from  3-PARTITION  (see  [Garey  and  Johnson,  1979]). 

Proof  for  PROBLEM  3: 

We  prove  that  PARTITION  is  reduced  to  PROBLEM  3.  Given  a.  (as  above), 
we  define  the  following  instance  of  PROBLEM  3: 

T=Pn+2  • 

W  =  {a-. ,  a«.  ...»  a  ,  x}  where  x>max  {a.. }   ,  and 

1  1 

k  =  x  +  p  *sa.  . 

If  there  exists  a  solution  to  PARTITION  then  the  labeling  shown  in 
figure  6.1  has  a  radius  R   =  k. 


Ill 


/ 

a.'s  for  iel     *-x      a. 's  for  i^I 


Proof  for  PROBLEM  3 
Figure  6.1 

If  there  exists  a  labeling  with  radius  #>_  x  +  j  Za,- »  then  ^et  D  De 
a  center  and  DB  a  radius  (see  figure  6.2).  Therefore 

d(D,B)  >_  x  +  j  za.  ,  and  hence  d(A,D)  <.  ^  za. . 


A 

•- 

— •  •  • 

Proof  for 

o  y 

•  •— 
PROBLEM 

C 

-♦ 

3 

B 

(continue) 

1 

rigure  6 

.2 

Also  d(D,B)  <_d(A,C),  otherwise  D  is  not  a  center.  Therefore 

x  +  \  za-j  1  d(D,B)  <_  d(A,C)  <_  y  +  d(A,D)  <  y  +  j  Eai   , 
hence  x<y.  But  y<x,  hence  y=x,  and 

d(A,D)  =  d(B,C)  =  j  Zai   , 
which  solves  the  PARTITION  problem. 


o 


PROBLEM  1 ,  PROBLEM  2     and  PROBLEM  3     are  also  NP-complete  for  vertex 
labeling.     The  reductions  are  from  the  corresponding  edge  labeling  problems. 
Adding  a  weight  0  to  the  given  weights,  a  solution  for  the  vertex  labeling 
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problem  Induces  a  solution  for  the  corresponding  edge  labeling  problem.  This 
is  done  by  regarding  the  vertex  labeled  0  as  a  root  of  the  tree  and  using  a 
label  of  any  other  vertex  to  label  the  edge  directed  to  this  vertex. 

Proof  for  PROBLEM  4: 

We  prove  that  MAXIMUM  TERMINAL  SPANNING  TREE  1s  reduced  to  PROBLEM  4. 

Given  a  graph  G  with  n+1  vertices  and  k>0  (an  instance  of  MAXIMUM  TERMINAL 
SPANNING  TREE),  we  define  the  following  instance  of  PROBLEM  4:  G  is  the  same 
given  graph  and  W  contains  n-k  O's  and  all  the  rest  l's.  We  prove  that  G  has 
a  spanning  tree  with  at  least  k  leaves  iff  G  has  a  labeling  with  radius  R<\ . 

Suppose  G  has  a  spanning  tree  with  at  least  k  leaves.  This  yields 
a  labeling  of  G  with  radius  R <1 ,  by  labeling  the  internal  edges  (of  the  tree) 
with  O's  ,  and  thus  eyery   internal  vertex  (of  the  tree)  is  a  center  for  the 
graph  G  with  radius  2?<1 . 

Suppose  there  exists  a  labeling  of  G  with  radius  #<_  1 .  Let  A  be  a 
center  of  the  labeled  graph,  and  apply  a  shortest-path  algorithm  from  A  to  get 
a  'shortest-path  spanning  tree'  with  radius  R<_  1.  The  tree  is  labeled  with  at 
least  k  l's.  In  this  tree  there  exists  at  most  one  edge  labeled  1  on  each  path 
from  A  to  any  leaf,  since  the  radius  is  <  1.  Therefore  the  number  of  leaves  in 
the  spanning  tree  is  at  least  as  the  number  of  l's  in  the  tree,  i.e.  >  k. 

O 

Proof  for  PROBLEM  5: 

The  reduction  is  similar  to  the  one  of  PROBLEM  4.        ,-* 

Conjecture:   The  following  problem  is  also  NP-complete:  Given  a  connected 
graph  and  a  set  of  0,1  weights,  to  determine  whether  there  exists  a  labeling 
with  diameter  Z?<1 . 
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We  have  the  following  observations  about  this  problem.  Suppose  the 
graph  G  is  labeled  with  the  given  weights  such  that  its  diameter  D<] .  It  is 
clear  that  z?=0  iff  G  contains  a  spanning  tree  with  all  edges  labeled  0.  Thus 
the  interesting  case  is  when  z?=l.  In  this  case  we  may  assume  that  the  edges 
labeled  with  0  generate  a  forest  in  G,  since  if  they  generate  any  cycle  we  can 
replace  one  0-edge  with  a  1-edge  without  increasing  the  diameter  (putting 
this  0-edge  in  another  arbitrary  place  not  closing  a  cycle  of  0's).  Let  T.= 
(V^.E^),  i=l,2,...,j  be  the  trees  in  this  forest.  A  graph  G'  =  (V\E')   is 

constructed  from  G  by  contructing  (see  [Harary,  1969])  all  the  vertices  of 
each  tree  T.,  i  =  l,2,...,j,  into  one  vertex  u. .  It  can  be  shown  that  G  is 
labeled  with  a  diameter  I>=1  if  and  only  if  G1  is  a  complete  graph.  This 
observation  lead  us  to  this  conjecture. 

6.4  RADIUS:  POLYNOMIAL  RESULTS 

In  the  previous  section  we  proved  that  minimizing  the  diameter  or 
the  radius  in  a  tree  are  NP-complete  problems.  Now  we  present  polynomial - 
time  algorithms  for  special  cases  of  these  problems. 

PROBLEM  6:  Given  a  tree  T  and  a  set  of  0,1  weights  W,  to  find  a  labeling 
with  minimal  radius. 
The  following  algorithm  solves  this  problem  in  0(n)  running-time: 
Algorithm  MIN_RADIUS_I 

1.   while  (number  of  leaves  in  T)  <_  (number  of  1's  in  W) 
do   begin 

label  all  the  terminal  edges  with  1  ; 
delete  the  terminal  edges  from  T  ; 
delete  the  used  Vs  from  W  ; 
end  ; 
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2.   Put  all  the  remaining  l's  on  terminal  edges; 
label  all  other  edges  with  O's. 
Proof:  Follows  immediately.  /~\ 

Algorithm  MIN_RADIUS_I  is  applicable  also  for  rooted  trees. 

PROBLEM  7:  Like  PROBLEM  6,  for  a  set  of  a,b  weights,  a<b. 

The  following  algorithm  solves  this  problem  in  O(nlogn)  running- 
time  in  the  case  when  T  is  a  rooted  tree.  An  0(n  logn)  algorithm  for  unrooted 
(free)  trees  is  obtained  by  applying  this  algorithm  from  every   vertex  in  the 
tree. 

Algorithm  MIN.RADIUS.II 

1.  Label  all  edges  of  T  with  a's; 

Insert  all  terminal  edges  into  a  'candidate  list'; 

2.  while  W  contains  at  least  one  b 

do   begin 

find  an  edge  (x,y)  in  the  'candidate  list',  which 
lies  on  a  path  of  minimal  length  from  the 
root  to  a  leaf  ; 
label  (x,y)  with  b  ; 
delete  this  b  from  W  ; 
delete  (x,y)  from  the  'candidate  list'  ; 
if  x  has  no  sons  labeled  with  a 
then  begin 

find  the  unique  w  such  that  (w,x)eT  ; 
insert  (w,x)  into  the  'candidate  list'  ; 
end  ; 
end . 
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Proof:  Let  T  be  the  tree  labeled  by  the  algorithm  while  T'  is  an  optimally 
labeled  tree  minimizing  the  radius  and  satisfying  the  condition  that  if  a 
(directed)  edge  (x,y)  is  labeled  with  b  then  all  edges  in  the  subtree  T' 
rooted  at  y  are  labeled  with  b  (all  the  b-weights  are  pushed  towards  the 
leaves).  Note  that  T  satisfies  this  condition  by  the  algorithm. 

First  we  show  that  there  exists  such  an  optimal  labeled  tree  T1 . 
Let  T"  be  an  optimal  labeled  tree  not  satisfying  this  property.  Denote  by 
d,  d'  and  d"  distances  in  T,  T'  and  T",  respectively  (note  that  all  these 
three  trees  differ  only  by  their  labelings).  There  exists  an  edge  (x,y) 
labeled  b  in  T"  while  some  edge  (w,z)  in  T"     is  labeled  with  a.  Exchanging 
the  weights  of  the  edges  (x,y)  and  (w,z)  yields  a  labeling  where  d"(r,u)  is 
decreased  by  b-a  for  e^/ery     ueT"-T"  and  is  not  changed  for  any  other  vertex 
in  the  tree.  Repeating  this  process  yields  a  labeled  tree  T1 ,  satisfying 
the  desired  property,  without  increasing  the  radius,  and  therefore  V  is  also 
optimal . 

Assume  now  that  the  tree  T  labeled  by  the  algorithm  is  not  optimal, 
i.e.,  there  exists  a  vertex  xeT  such  that  d(r,x)>f?,  where  R     is  the  radius 
of  T' .  Then  the  path  from  r  to  x  contains  an  edge  labeled  with  b  in  T  and 
labeled  with  a  in  T' .  Let  (y,z)  be  the  first  such  edge  from  the  root  r.  On 
the  other  hand  there  exists  an  edge  (u,v)  labeled  with  b  in  T'  and  labeled 
with  a  in  T.  By  the  property  of  T'  all  the  edges  in  the  subtree  V  are 
labeled  with  b.  Let  w  be  a  vertex  in  T'  with  maximal  distance  d'(r,w)  . 
d'(r,w)<i?  since  R   is  the  radius  of  T' .    This  discussion  is  illustrated  in 
figure  6.3  . 
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An  illustation  for  algorithm  MIN_RADIUS_I I 
Figure  6.3 


But  in  T 


d(r,w)  <.d'(r,w)  +  (a-b)  <_  R  +   (a-b) 

since  the  edge  (u,v)  is  labeled  with  a  in  T.  On  the  other  hand,  before  the 
algorithm  labeled  (y,z)  in  T  with  b,  we  had 

d(r,x)  >  r  +  (a-b)  . 

Hence  the  algorithm  should  have  labeled  the  edge  (u,v)  with  b  before  labeling 
the  edge  (y,z),  a  contradiction. 

Therefore  the  labeling  of  T  is  optimal.  /-\ 

PROBLEM  8:  Like  PROBLEM  7,  for  a  maximal  radius. 
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In  a  rooted  tree  this  is  done  by  finding  a  longest  path  from  the  root, 
labeling  its  edges  with  as  many  b's  as  possible,  and  labeling  the  rest  of  the 
edges  arbitrarily.     In  an  unrooted  tree  this  is  done  by  finding  a  longest  path, 
labeling  it  with  ■  fx/2l    b's  on  one  end  and    |x/2j    b's     on  the  other  (where  x  is 
the  number  of  b's),  and  labeling  the  rest  of  the  edges  arbitrarily.     Both 
algorithms  are  of  0(n)  running-time. 

PROBLEM  9:     Given  a  tree  T  and  a  set  of  0,1  weights  W,  to  find  a  labeling 
with  minimal   diameter. 

This  is  done  in  0(n)  running-time  algorithm,  identical   to  that  of 
PROBLEM  6. 

6.5     AVERAGE  MEASUREMENTS:   POLYNOMIAL  RESULTS 

In  this  section  we  present  polynomial-time  algorithms  for  optimization 

problems  concerning  the  functions  SUMDIS ,  SUMAVR,  SUMMIN  and  SUMMAX. 

PROBLEM  10:     Given  T  and  W,  to  find  a  labeling  that  maximizes     or 

minimizes     SUMDIS(T)  =     z     d(i,j)   . 

i<j 

Following  [Hu,   1974]     we  observe  that  instead  of  summing  the  numbers 
d(i,j)'s     we  can  count  the  number  of  occurrences  of  each  edge  in  the  summation. 
Suppose  that  an  edge  e  partitions  the  tree  T  into  two  subtrees  with  k  and  n+l-k 
vertices,  respectively.     Then  e  is  counted  k(n+l-k)   times  in    SUMDIS{J).     We 
call     this  value     c(e)  =  k(n+l-k)     the  connection     of  e.     Now 

swiDisfi)  =     z  c(e)-f(e) 
e 

and  the  next  theorem  follows: 
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Theorem  6.1 :  Given  the  connections  of  the  edges  of  the  tree  T,  c(e,)  <_c(e«) 
<_  •••  <_c(en),  the  labeling  of  e..  with  w.  (  w n+i  j  )»  for  each  i,  maximizes 
(minimizes)  the  function  SUMDIS(J). 


It) 

^•><  N  <   n!  . 


The  optimal  labeling  is  unique  up  to  permuting  edges  with  equal 
connections.  Assuming  that  all  weights  are  distinct,  the  number  N  of  optimal 
labelings  satisfies 

2 
The  bounds  are  achieved  for  the  graphs  P  .,  and  K,   . 

The  connections  can  be  calculated  in  0(n)  time  by  regarding  T  as 
rooted  and  traversing  it  in  postorder.  Therefore,  and  because  of  the  sorting 
step  involved,  the  algorithm  for  optimal  labeling  (for  either  maximizing  or 
minimizing  SUMDIS{1)   )  has  an  O(nlogn)  running-time. 

PROBLEM  11:  Given  T  and  W,  to  find  a  labeling  that  maximizes  or 
minimizes  SUMAVR(J)   =  E  i  /I'M  i   • 

i<j  TpTujIT 

The  solution  is  essentially  similar  to  the  solution  of  PROBLEM  10. 
The  only  change  is  that  the  connections  c(e)  are  replaced  by  the  average 
connection  c(e)  defined  by 

c(e)  =   E  c,(e)/j  , 
J   J 

where  c.(e)  is  the  number  of  paths  of  length  j  through  the  edge  e. 

J 

Calculating  the  average  connections  is  done  by  a  breadth-first  search 

(see  [Aho,  Hopcroft  and  Ullman,  1974])  from  every   edge  e,  getting  the  number 
of  paths  of  length  i  on  each  side  of  e  (for  each  i),  and  then  'merging'  the 
information  from  both  sides  of  e  by  a  convolution  doable  in  O(nlogn)  time 
(see  [Aho,  Hopcroft  and  Ullman,  1974]).  Therefore  the  algorithm  in  this  case 
is  of  0(n  logn)  running-time. 
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PROBLEM  12:  Given  T  and  W,  to  find  a  labeling  that  minimizes 

summax(1)  =  z    max(i,j)  . 

The  problem  of  maximizing    SUMMAX{1)     is  still  open.     Similarly,  our 
algorithm  for  solving  PROBLEM  12  solves  also  the  problem  of  maximizing     the 
function    SUMMIN{T),  while  the  problem  of  minimizing    SUMMIN{1)     is  still   open. 
Conjecture:     The  problem  of  maximizing  (minimizing)     SUMMX{J)     (  SUMMIN{J)   ) 
is  NP-hard. 

The  algorithm  to  solve  PROBLEM  12  is  based  on  the  following  theorem: 

Theorem  6.2:     Given  T,W  as  above,  and  assuming    w.<w.+,  for  each  i,  we  have; 

1.  In  any  labeling  that  minimizes     SUNMAX{1)     the  largest  weight 

w       labels  a  terminal   edge. 
n 

2.  For  any  terminal  edge  there  exists  an  optimal   labeling  in  which 

w    labels  this  edge, 
n 

3.  min     SUMMAXJl)       =     z  i-w.        (which  means  that  this  optimal 

f  f  i         ' 

value  is  independent  on  the  tree  T). 

Note:     The  result  holds  also  for  the  case    0  <_  w,   <_  •  •  •   <_  w  ,  and  the 

modification  of  the  proof  is  not  discussed  here. 

Proof:     We  prove  the  theorem  by  induction  on  n.     The  theorem  holds   for  n=l,2 
and  3  .     Assume  it  holds  for  a  tree      with  less  than  n  edges.     Let  T  be  a  tree 
with  n  edges.     Assume  f  is  a  labeling  that  minimizes     SUMMAX{1) ,  and  that    w 
labels  a  non-terminal  edge    e,  thus  partitioning  T  into  two  subtrees  T,   and  T„ 
containing     k  and  n+l-k  vertices,  respectively,  for  some     2  <_  k  <_  n-1.       Wlog 
we  assume  that    w  _,     labels  an  edge  in     T, .     Applying  the  inductive  hypothesis 
to  T,  we  know  that  there  exists  an  optimal   labeling  for  T,     in  which  a  specified 
terminal  edge  is  labeled  with  w    , .     We  choose  this  edge  to  be  such  that 
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removing  it  with  its  corresponding  leaf  will  not  disconnect  T  (see  figure  6.4) 


n+l-k  vertices 


The  tree  T  and  the  corresponding  subtrees 
Figure  6.4 


Note  that  all  the  paths  from  vertices  in  T, ,  containing  w  ,  are  not  effected 
by  this  new  labeling  of  T,  (since  on  them  w  is  maximal),  and,  as  for  T, ,  the 
value  of  SUMMAX  is  not  changed  by  the  inductive  hypothesis.  In  the  current 
labeling  the  contribution  of  w   and  w  ,  to  SUMMAX{T)     is 


k(n+l-k)wn  +  (k-l)wn_1 


(1) 


By  Interchanging  w  ,  and  w   we  get  a  labeling  f  in  which  their  contributi 


on 


is 


nwn  +  (k-l)(n+l-k)wn_1 


But  substracting  (1)  from  (2)  gives 


(2)  -  (1)  =  (k-l)(n-k)(wn_1  -  wn)  <  0 

because   l<k<n  and  w„  ,<w„  ,  a  contradiction. 

n- 1  n 


(2) 
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We  know  that  in  an  optimal  labeling  of  T  w   labels  a  terminal  edge 
It  is  clear  that  the  contribution  of  w  to  SUMMAX[T)     is 

nwn  (3) 

It  is  also  clear  that  we  can  label  any  terminal  edge  with  w  ,  and,  after 
erasing  it  together  with  the  corresponding  leaf,  we  apply  the  inductive 
hypothesis  to  the  remaining  tree,  getting  optimal  labeling  with  a  value  of 

n-1 

z      iw.  » 
i-1   n 

n 
hence  for  T  we  get  the  optimal  value  of  z   iw.  .  /-\ 

1-1  1  ^ 

Following  the  above  proof,  it  is  clear  how  to  get  an  0(n)  labeling 

algorithm  that  minimizes  SUMMAX(T). 

As  for  the  number  of  optimal  label ings,  it  can  be  shown  that,  when  all 

the  weights  are  distict,  the  number  N  of  such  labelings  satisfies 

2n_1  <  N  <  n!  , 

when  the  bounds  are  achieved  for  the  graphs  P  .,  and  K,  . 

PROBLEM  13:  Given  T  and  W,  to  find  a  labeling  that  maximizes 
SUMMAX{T). 
We  stated  our  conjecture  (above)  that  this  problem  is  NP-hard.  As 
a  matter  of  fact,  the  problem  is  still  open  even  in  the  special  case 


when 


the  tree  T  is  a  path  (Pn+i).  Note  that  the  'natural'  approach  of  labeling 


the  edge  with  maximal  connection  with  the  largest  weight  w  is  not  optimal 
for  example,  consider  the  case  when  T=Pfi  and  W={1 ,2,3,8,9}  . 
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We  consider  now  the  case  when  T  is  a  path  and  W  contains  a  bounded 
number  of  distinct  weights.  We  present  an  exact  description  of  the  solution 
when  W  contains  only  two  kinds  of  weights,  and  give  an  optimal  labeling 
algorithm  for  the  case  when  W  contains  a  bounded  number  of  distinct  weights. 

A  path,  two  weights 

Here  T=P  ,  ,  and  W  contains  k  weights  b  and  n-k  weights  a,  a<b. 
The  structure  of  the  solution  is  characterized  in  the  following  theorem: 
Theorem  6.3:  Let  T  and  W  be  as  above,  and  denote  a  labeling  f  as  a  sequence 

xn  xi     xt-i  xt 
a  uba  'b-.-ba  z   'ba  z  (4) 

where  x.  >_  0  and   Ex.  =  n-k.   If  the  labeling  f  is  optimal, 
then 

lxi  "  xi  I  1  1     f°r  every  i  and  j. 

Proof:  For  a  labeling  function  f  as  described  in  (4),  Let  N^(a)  (N^(b)) 
denote  the  number  of  paths  p(i,j)  in  which  the  largest  label  is  a  (b).  It 
is  clear  that  by  minimizing  N,.(a)  over  all  possible  labelings  f  we  will  also 
maximize  SUMMAX(T)   over  all  possible  labelings  f,  since  a<b  and 

Nf(a)  +  Nf(b)  =  (n2])  • 
It  is  clear  that 


v,  ■  m  •  r:) 


fx.+r 

+  ...  +  |  t    .   (5) 


Assume  that  f  is  an  optimal   labeling  and  that  the  theorem  doesn't  hold.     Wlog 
we  may  assume  that 

xQ  >  x^l     .  (6) 
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We  define  a  labeling     f     as  follows: 


xn-l  x,+l  x9       xt  -|  x. 

f  :    a  °  ba  '  ba  ^b  •••  ba  z   'ba  z  (7) 

By  (5)  we  have 

v-ra-  r;i ra  •  <•• 

Using  (5),  (8)  and  (6)  it  can  be  shown  that 

Nf,(a)  <  Nf(a)  , 

hence  SUMMAX* .  (T)  >  SUMMAX^ (T) ,  a  contradiction.  q 

Using  this  theorem  it  is  clear  how  one  should  'spread'  the  a's  among 
the  b's  in  order  to  get  a  labeling  that  maximizes  the  function  SUMMAX. 

A  path,  k  weights 

Here  T=pn+i  »  and  W  contains  m.  copies  of  w,  for  i=l,2,...,k, 
where   m,  +  nu  +  •••  +  m.  =  n  and  w,  >  w«  >  •••  >  w.>  0.  The  following 

dynamic  programming  approach,  proposed  by   [Megiddo,  1978]  ,  solves  the 

2k+l 
problem  in  0(n    ).   Let  T  be  a  path  of  length  zs.  labeled  with  s.  copies 

of  w.  for  i=l,2,...,k,  where  the  leftmost  label  is  wa  and  p.  is  the 
1  a      i 

position  of  the  first  occurrence  of  w.   (from  the  left).  Denote  by 

4>  (  s-j,  s2,  ...  ,  sk,  wa>  p.j,  p2,  ...  ,  pk  ) 

the  maximal  value  for  SUMMAX(T) .     Note  that  p  =1. 

a 

If  there  exists  b  such  that    p.  =2     then 

<j>(s1,...,sk,wa,p1,...pk)  =     max  {<|>(sl ,. . .  ,sa_1  ,sa>l  ,sa+1 sk'wb'pl  "  *  *  'pk^  + 

pa 

♦  (a)   } 
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where  p\   =  p.j-1  for  i>a  and  i|;(a)  is  the  contribution  of  the  leftmost 

vertex  to  SUMMAX[T)    ,   easily  calculated  using  the  p/s.  If  there  exists 

no  such  b  then  wa  labels  the  second  edge  too,  and  we  compute  <J>  by 

a 

^(s^  ,s2>...  ,s|<,wa,p1 ....  ,pk)  =  <f)(s.| , ...  'sa_i»sa"'I'sa+i ....  ,sk,wa,p1-l ,...  .Pk-1) 

+  Ma) 

where    i^(a)     is  as  above. 

The  number  of  choices  for  the    s.'s  is     0(n  ),  and  so  is  that  for  the 

2k 
p. 's.     Hence  the  function  <j>  is  computed  in     0(n     )     points,  where  each  calcu- 

2k+l 
lation  is  at  most    0(n),  therefore  the  algorithm  runs  in  time     0(n         ). 


6.6  OPEN  PROBLEMS 

We  mention  three  open  problems  which  arose  while  working  on  these 
labeling  problems  (the  first  two  of  these  open  problems  are  mentioned  in  the 
text): 

1.  Given  a  graph  and  a  set  of  zero-one  weights,  to  determine 
whether  there  exists  a  labeling  with  diameter  <  1. 

2.  Given  a  path  P   and  a  set  of  weights  W,  to  find  a  labeling 
that  maximizes  SUMMAX{J) . 

3.  Given  a  tree  T  and  a  set  W  of  a,b  weights,  to  find  a  labeling 
that  maximizes  SUMMAx{J)     (this  is  an  interesting  special  case 
of  the  open  problem  of  maximizing  SUMMAX(l)   for  general  T  and  W) 
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